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GROUP B STREPTOCOCCUS ANTIGENS 

5 

FIELD OF THE INVENTION 

The present invention is related to antigens, more 
particularly protein antigens of group B streptococcus (GBS) 
10 bacterial pathogen which are useful as vaccine components 
for therapy and/or prophylaxis. 

BACKGROUND OF THE INVENTION 

15 

Streptococcus are gram {+) bacteria that are differentiated 
by group specific carbohydrate antigens A through O found on 
their cell surface. Streptococcus groups are further 
distinguished by type-specific capsular polysaccharide 
20 antigens. Several serotypes have been identified for the 
Group B streptococcus (GBS) : la, lb, II, III, IV, V, VI, 
VII and VIII. GBS also contains antigenic proteins known as 
n C-proteins H (alpha, beta, gamma and delta), some of which 
have been cloned. 

25 

Although GBS is a common component of the normal human 
vaginal and colonic flora this pathogen has long been 
recognized as a major cause of neonatal sepsis and 
meningitis, late-onset meningitis in infants, postpartum 

30 endometritis as well as mastitis in dairy herds. Expectant 
mothers exposed to GBS are at risk of postpartum infection 
and may transfer the infection to their baby as the child 
passes through the birth canal. Although the organism is 
sensitive to antibiotics, the high attack rate and rapid 

35 onset of sepsis in neonates and meningitis in infants 
results in high morbidity and mortality. 
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To find a vaccine that will protect individuals from GBS 
infection, researches have turned to the type-specific 
antigens. Unfortunately these polysaccharides have proven to 
5 be poorly immunogenic in humans and are restricted to the 
particular serotype from which the polysaccharide 
originates. Further, capsular polysaccharide elicit a T 
cell independent response i.e. no IgG production. 
Consequently capsular polysaccharide antigens are unsuitable 
10 as a vaccine component for protection against GBS infection. 

Others have focused on the C-protein beta antigen which 
demonstrated immunogenic properties in mice and rabbit 
models. This protein was found to be unsuitable as a human 

15 vaccine because of its undesirable property of interacting 
with high affinity and in a non- immunogenic manner with the 
Fc region of human IgA. The C-protein alpha antigen is rare 
in type III serotypes of GBS which is the serotype 
responsible for most GBS mediated conditions and is 

20 therefore of little use as a vaccine component. 

Therefore there remains an unmet need for GBS antigens that 
may be used as vaccine components for the prophylaxis and/or 
25 therapy of GBS infection. 

SUMMARY OF THE INVENTION 

30 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence selected from the group consisting of: 
SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 

35 SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
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SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, 

SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:29, 

SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 

SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, 

5 SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or fragments, 
analogs or derivatives thereof. 

In other aspects, there is provided vectors comprising 
polynucleotides of the invention operably linked to an 
10 expression control region, as well as host cells transfected 
with said vectors and methods of producing polypeptides 
comprising culturing said host cells under conditions 
suitable for expression. 

15 In yet another aspect, there is provided novel polypeptides 
encoded by polynucleotides of the invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 

20 

Figure la is the DNA sequence of clone 1 (SEQ ID NO :l) with 
corresponding amino acid sequences for open reading frames; 

figure lb is the amino acid sequence SEQ ID NO: 2; 

figure lc is the amino acid sequence SEQ ID NO: 3; 

25 figure Id is the amino acid sequence SEQ ID NO: 4; 

figure le is the amino acid sequence SEQ ID NO: 5; 

figure If is the amino acid sequence SEQ ID NO: 6; 

Figure 2a is the DNA sequence of clone 2 (SEQ ID NO :7) with 
30 corresponding amino acid sequences for open reading frames; 

figure 2b is the amino acid sequence SEQ ID NO: 8; 

figure 2c is the amino acid sequence SEQ ID NO: 9; 

figure 2d is the amino acid sequence SEQ ID NO: 10; 

figure 2e is the amino acid sequence SEQ ID NO: 11; 

35 figure 2f is the amino acid sequence SEQ ID NO: 12; 
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Figure 3a is the DNA sequence of clone 3 (SEQ ID NO :13) 
with corresponding amino acid sequences for open reading 



frames; 
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with corresponding amino acid sequences for open reading 
15 frames; 

figure 4b is the amino acid sequence SEQ ID NO: 23; 
figure 4c is the amino acid sequence SEQ ID NO: 24; 
figure 4d is the amino acid sequence SEQ ID NO: 25; 
figure 4e is the amino acid sequence SEQ ID NO:26; 

20 

Figure 5a is the DNA sequence of clone 5 (SEQ ID NO :27) 
with corresponding amino acid sequences for open reading 





frames; 
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figure 


7b 


is 


the 


amino 


acid 


sequence 


SEQ 


ID NO:38; 
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figure 7c is the amino acid sequence SEQ ID NO: 39; 
figure 7d is the amino acid sequence SEQ ID NO: 40; 
figure 7e is the amino acid sequence SEQ ID NO: 41; 

Figure 8 is the DNA sequence of a part of clone 7 including 
a signal sequence (SEQ ID NO :42) ; 

Figure 9 is the DNA sequence of a part of clone 7 without a 
signal sequence (SEQ ID NO :43) ; 

Figure 9a is the amino acid sequence (SEQ ID NO:44); 

Figure 10 represents the distribution of anti-GBS ELISA 
titers in sera from CD-I mice immunized with recombinant GBS 
protein corresponding to the SEQ ID NO: 39. 
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DETAILED DESCRIPTION OF THE INVENTION 



10 



15 



20 



The present invention relates to novel antigenic 
polypeptides of group B streptococcus (GBS) characterized by 
the amino acid sequence selected from the group consisting 
Of: 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO.-10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:15, 
SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, 
SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, 
SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or fragments, 
analogs or derivatives thereof. 

A preferred embodiment of the invention includes SEQ ID 
NO :39 and SEQ ID NO: 44. 

A further preferred embodiment of the invention is SEQ ID 
NO :39. 



A further preferred embodiment of the invention is SEQ ID 
25 NO : 44. 

As used herein, "fragments", "derivatives" or "analogs" of 
the polypeptides of the invention include those polypeptides 
in which one or more of the amino acid residues are 
30 substituted with a conserved or non-conserved amino acid 
residue (preferably conserved) and which may be natural or 
unnatural . 



The terms «fragments», «derivatives» or «analogues» of 
35 polypeptides of the present invention also include 

polypeptides which are modified by addition, deletion, 
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substitution of amino acids provided that the polypeptides 
retain the capacity to induce an immune response. 

By the term ^conserved amino acid» is meant a substitution 
of one or more amino acids for another in which the 
antigenic determinant (including its secondary structure and 
hydropathic nature) of a given antigen is completely or 
partially conserved in spite of the substitution. 



15 



20 



10 For example, one or more amino acid residues within the 
sequence can be substituted by another amino acid of a 
similar polarity, which acts as a functional equivalent, 
resulting in a silent alteration. Substitutes for an amino 
acid within the sequence may be selected from other members 
of the class to which the amino acid belongs. For example, 
the nonpolar (hydrophobic) amino acids include alanine, 
leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan and methionine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, 
asparagine and glutamine. The positively charged (basic) 
amino acids include arginine, lysine and histidine. The 
negatively charged (acidic) amino acids include aspartic 
acid and glutamic acid. 

25 Preferably, derivatives and analogs of polypeptides of the 
invention will have about 70% identity with those sequences 
illustrated in the figures or fragments thereof. That is, 
70% of the residues are the same. More preferably 
polypeptides will have greater than 95% homology. In another 
preferred embodiment, derivatives and analogs of 
polypeptides of the invention will have fewer than about 20 
amino acid residue substitutions, modifications or deletions 
and more preferably less than 10. Preferred substitutions 
are those known in the art as conserved i.e. the substituted 
residues share physical or chemical properties such as 
hydrophobicity, size, charge or functional groups. 



30 



35 
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10 



Furthermore, in those situations where amino acid regions 
are found to be polymorphic, it may be desirable to vary one 
or more particular amino acids to more effectively mimic the 
5 different epitopes of the different GBS strains. 

Also included are polypeptides which have fused thereto 
other compounds which alter the polypeptides biological or 
pharmacological properties i.e. polyethylene glycol (PEG) to 
increase half -life; leader or secretory amino acid sequences 
for ease of purification; prepro- and pro- sequences; and 
(poly) saccharides . 

Moreover, the polypeptides of the present invention can be 
15 modified by terminal -NH 2 acylation (eg. by acetylation, or 
thioglycolic acid amidation, terminal carbosy amidation, 
e.g. with ammonia or methylamine) to provide stability, 
increased hydrophobicity for linking or binding to a support 
or other molecule. 

20 

Also contemplated are hetero and homo polypeptide multimers 
of the polypeptide fragments, analogues and derivatives. 
These polymeric forms include, for example, one or more 
polypeptides that have been cross-linked with cross-linkers 
25 such as avidin/biotin, gluteraldehyde or dimethyl - 
superimidate . Such polymeric forms also include 
polypeptides containing two or more tandem or inverted 
contiguous sequences, produced from multicistronic mRNAs 
generated by recombinant DNA technology. 
Preferably, a fragment, analog or derivative of a 
polypeptide of the invention will comprise at least one 
antigenic region i.e. at least one epitope. 

In order to achieve the formation of antigenic polymers 
35 (i.e. synthetic multimers), polypeptides may be utilized 

having bishaloacetyl groups, nitroarylhalides , or the like, 

8 
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where the reagents being specific for thio groups. 
Therefore, the link between two mercapto groups of the 
different peptides may be a single bond or may be composed 
of a linking group of at least two, typically at least four, 
5 and not more than 16, but usually not more than about 14 
carbon atoms. 

In a particular embodiment, polypeptide fragments, analogs 
and derivatives of the invention do not contain a methionine 

10 (Met) starting residue. Preferably, polypeptides will not 
incorporate a leader or secretory sequence (signal 
sequence) . The signal portion of a polypeptide of the 
invention may be determined according to established 
molecular biological techniques. In general, the 

15 polypeptide of interest may be isolated from a GBS culture 
and subsequently sequenced to determine the initial residue 
of the mature protein and therefor the sequence of the 
mature polypeptide. 

20 According to another aspect, there is provided vaccine 

compositions comprising one or more GBS polypeptides of the 
invention in admixture with a pharmaceutical^ acceptable 
carrier diluent or adjuvant. 

25 Suitable adjuvants include oils i.e. Freund's complete or 
incomplete adjuvant; salts i.e. AlK(SOJ 2/ AlNa(SOJ 2 , 
AlNH 4 {S0 4 ) 2 , Al(OH) 3 , AlP0 4 , silica, kaolin; saponin 
derivative; carbon polynucleotides i.e. poly IC and poly AU 
and also detoxified cholera toxin (CTB)and E.coli heat 

30 labile toxin for induction of mucosal immunity. Preferred 
adjuvants include QuilA™, Alhydrogel™ and Adjuphos™. 
Vaccines of the invention may be administered parenterally 
by injection, rapid infusion, nasopharyngeal absorption, 
dermoabsorption, or bucal or oral. 
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Vaccine compositions of the invention are used for the 
treatment or prophylaxis of streptococcus infection and/or 
diseases and symptoms mediated by streptococcus infection, 
5 in particular group A streptococcus {pyogenes) , group B 
streptococcus (GBS or agalactiae) , dysgalactiae, uteris, 
nocardia as well as Staphylococcus aureus. General 
information about Streptococcus is available in Manual of 
Clinical Microbiology by P.R.Murray et 31.(1995, 6 th Edition, 

10 ASM Press, Washington, D.C.). More particularly group B 
streptococcus, agalactiae. In a particular embodiment 
vaccines are administered to those individuals at risk of 
GBS infection such as pregnant women and infants for sepsis, 
meningitis and pneumonia as well as immunocompromised 

15 individuals such as those with diabetes, liver disease or 

cancer. Vaccines may also have veterinary applications such 
as for the treatment of mastitis in cattle which is mediated 
by the above mentioned bacteria as well as E.coli. 

20 The vaccine of the present invention can also be used for 
the manufacture of a medicament used for the treatment or 
prophylaxis of streptococcus infection and/or diseases and 
symptoms mediated by streptococcus infection, in particular 
group A streptococcus (pyogenes) , group B streptococcus (GBS 

25 or agalactiae) , dysgalactiae, uteris, nocardia as well as 
Staphylococcus aureus. More particularly group B 
streptococcus, agalactiae . 

Vaccine compositions are preferably in unit dosage form of 
30 about 0.001 to 100 ^ig/kg (antigen/body weight) and more 
preferably 0.01 to 10 ng/kg and most preferably 0.1 to 1 
Hg/kg 1 to 3 times with an .interval of about 1 to 12 weeks 
intervals between immunizations, and more preferably 1 to 6 

10 
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weeks . 



10 



According to another aspect, there is provided 
polynucleotides encoding polypeptides of group B 
streptococcus (GBS) characterized by the amino acid sequence 
selected from the group consisting of: 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:15, 
SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, 
SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, 
SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO: 35, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID NO: 39, 
15 SEQ ID NO:40, SEQ ID NO:41 and SEQ ID NO:44 or fragments, 
analogs or derivatives thereof. 

Preferred polynucleotides are those illustrated in figures 
la (SEQ ID NO: 1), 2a (SEQ ID NO: 7), 3a (SEQ ID NO: 13), 4a 
20 (SEQ ID NO: 22), 5a (SEQ ID NO: 27), 6a (SEQ ID NO: 32), 7a 
(SEQ ID NO: 37), 8 (SEQ ID NO : 42) and 9 (SEQ ID .NO : 43) 
which correspond to the' open reading frames, encoding 
polypeptides of the invention. 

25 Preferred polynucleotides are those illustrated in figures 

la (SEQ ID NO: 1), 2a (SEQ ID NO: 7), 3a (SEQ ID NO: 13), 4a 
(SEQ ID NO: 22), 5a (SEQ ID NO: 27), 6a (SEQ ID NO: 32), 7a 
(SEQ ID NO: 37), 8 (SEQ ID NO : 42) and 9 (SEQ ID NO : 43) 
and fragments, analogues and derivatives thereof. 



30 



35 



More preferred polynucleotides of the invention are those 
illustrated in Figures 7 (SEQ ID NO : 37) , 8 (SEQ ID NO : 
42) and 9 (SEQ ID NO : 43) . 

Most preferred polynucleotides of the invention are those 
illustrated in Figures 8 (SEQ ID NO : 42) and 9 (SEQ ID NO 
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43) . 

It will be appreciated that the polynucleotide sequences 
illustrated in the figures may be altered with degenerate 
5 codons yet still encode the polypeptides of the invention. 

Due to the degeneracy of nucleotide coding sequences, other 
polynucleotide sequences which encode for substantially the 
same polypeptides of the present invention may be used in 
10 the practice of the present invention. These include but are 
not limited to nucleotide sequences which are altered by the 
substitution of different codons that encode the same amino 
acid residue within the sequence, thus producing a silent 
change . 

15 

Accordingly the present invention further provides 
polynucleotides which hybridize to the polynucleotide 
sequences herein above described (or the complement 
sequences thereof) having 50% and preferably at least 70% 
20 identity between sequences. More preferably polynucleotides 
are hybridizable under stringent conditions i.e. having at 
least 95% identity and most preferably more than 97% 
identity. 

25 By capable of hybridizing under stringent conditions is 
meant annealing of a nucleic acid molecule to at least a 
region of a second nucleic acid sequence (whether as cDNA, 
mRNA, or genomic DNA) or to its complementary strand under 
standard conditions, e.g. high temperature and/or low salt 

30 content, which tend to disfavor hybridization of 

noncomplementary nucleotide sequences. A suitable protocol 
is described in Maniatis T. et al. # Molecular cloning : A 
Laboratory Manual, Cold Springs Harbor Laboratory, 1982, 
which is herein incorporated by reference. 

35 

In a further aspect, polynucleotides encoding polypeptides 

12 
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of the invention, or fragments, analogs or derivatives 
thereof, may be used in a DNA immunization method. 
That is, they can be incorporated into a vector which is 
replicable and expressible upon injection thereby producing 
5 the antigenic polypeptide in vivo. For example 

polynucleotides may be incorporated into a plasmid vector 
under the control of the CMV promoter which is functional in 
eukaryotic cells. Preferably the vector is injected 
intramuscularly . 

10 

According to another aspect, there is provided a process for 
producing polypeptides of the invention by recombinant 
techniques by expressing a polynucleotide encoding said 
polypeptide in a host cell and recovering the expressed 
15 polypeptide product. Alternatively, the polypeptides can be 
produced according to established synthetic chemical 
techniques i.e. solution phase or solid phase synthesis of 
oligopeptides which are ligated to produce the full 
polypeptide (block ligation) . 

20 

For recombinant production, host cells are transfected with 
vectors which encode the polypeptide, and then cultured in a 
nutrient media modified as appropriate for activating 
promoters, selecting transf ormants or. amplifying the genes. 

25 Suitable vectors are those that are viable and replicable in 
the chosen host and include chromosomal, non- chromosomal and 
synthetic DNA sequences e.g. bacterial plasmids, phage DNA, 
baculovirus, yeast plasmids, vectors derived from 
combinations of plasmids and phage DNA. The polypeptide 

30 sequence may be incorporated in the vector at the 

appropriate site using restriction enzymes such that it is 
operably linked to an expression control region comprising a 
promoter, ribosome binding site (consensus region or Shine- 
Dalgarno sequence), and optionally an operator (control 

35 element) . One can select individual components of the 

expression control region that are appropriate for a given 
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host and vector according to established molecular biology 
principles (Sambrook et al, Molecular Cloning: A Laboratory 
Manual, 2nd ed. , Cold Spring Harbor, N.Y., 1989 incorporated 
herein by reference) . Suitable promoters include but are not 
5 limited to LTR or SV40 promoter, E.coli lac, tac or trp 
promoters and the phage lambda P L promoter. Vectors will 
preferably incorporate an origin of replication as well as 
selection markers i.e. ampicillin resistance gene. Suitable 
bacterial vectors include pET, pQE70, pQE60, pQE-9, pbs, 

10 pDIO phagescript, psiX174, pbluescript SK, pbsks, pNH8A, 

pNH16a, pNH18A, pNH46A, ptrc99a, pKK223-3, pKK233-3, pDR540, 
pRIT5 and eukaryotic vectors pBlueBacIII, pWLNEO, pSV2CAT, 
pOG44, pXTl, pSG, pSVK3, pBPV, pMSG and pSVL. Host cells 
may be bacterial i.e. E.coli, Bacillus subtilis, 

15 Strep tomyces; fungal i.e. Aspergillus niger, Aspergillus 
nidulins; yeast i.e. Saccharomyces or eukaryotic i.e. CHO, 
COS. 

Upon expression of the polypeptide in culture, cells are 
20 typically harvested by centrifugation then disrupted by 

physical or chemical means (if the expressed polypeptide is 
not secreted into the media) and the resulting crude extract 
retained to isolate the polypeptide of interest. 
Purification of the polypeptide from culture media or lysate 
25 may be achieved by established techniques depending on the 
properties of the polypeptide i.e. using ammonium sulfate or 
ethanol precipitation , acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, hydroxyl apatite 
30 chromatography and lectin chromatography. Final 
purification may be achieved using HPLC. 

The polypeptide may be expressed with or without a leader or 
secretion sequence. In the former case the leader may be 
35 removed using post-translational processing (see US 
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4,431,739; 4,425,437; and 4,338,397 incorporated herein by 
reference) or be chemically removed subsequent to purifying 
the expressed polypeptide. 

5 According to a further aspect, the GBS polypeptides of the 
invention may be used in a diagnostic test for streptococcus 
infection in particular GBS infection. Several diagnostic 
methods are possible, for example detecting streptococcus 
organism in a biological sample, the following procedure may 
10 be followed: 

a) obtaining a biological sample from a patient; 

b) incubating an antibody or fragment thereof reactive 
with a GBS polypeptide of the invention with the 
biological sample to form a mixture; and 

15 c) detecting specifically bound antibody or bound fragment 
in the mixture which indicates the presence of 
streptococcus . 

Alternatively, a method for the detection of antibody 
20 specific to a streptococcus antigen in a biological sample 
containing or suspected of containing said antibody may be 
performed as follows: 

a) isolating a biological sample from a patient; 

b) incubating one or more GBS polypeptides of the 

25 invention or fragments thereof with the biological 

sample to form a mixture; and 

c) detecting specifically bound antigen or bound fragment 
in the mixture which indicates the presence of antibody 
specific to streptococcus. 

30 

One of skill in the art will recognize that this diagnostic 
test may take several forms, including an immunological test 
such as an enzyme-linked immunosorbent assay (EL ISA) , a 
radioimmunoassay or a latex agglutination assay, essentially 
35 to determine whether antibodies specific for the protein are 
present in an organism. 
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The DNA sequences encoding polypeptides of the invention may 
also be used to design DNA probes for use in detecting the 
presence of streptococcus in a biological sample suspected 
5 of containing such bacteria. The detection method of this 
invention comprises : 

a) isolating the biological sample from a patient; 

b) incubating one or more DNA probes having a DNA sequence 
encoding a polypeptide of the invention or fragments 

10 thereof with the biological sample to form a mixture; 

and 

c) detecting specifically bound DNA probe in the mixture 
which indicates the presence of streptococcus bacteria. 

15 The DNA probes of this invention may also be used for 

detecting circulating streptococcus i.e. GBS nucleic acids 
in a sample, for example using a polymerase chain reaction, 
as a method of diagnosing streptococcus infections. The 
probe may be synthesized using conventional techniques and 

20 may be immobilized on a solid phase, or may be labeled with 
a detectable label. A preferred DNA probe for this 
application is an oligomer having a sequence complementary 
to at least about 6 contiguous nucleotides of the GBS 
polypeptides of the invention. 

25 

Another diagnostic method for the detection of streptococcus 
in a patient comprises: 

a) labeling an antibody reactive with a polypeptide of the 

invention or fragment thereof with a detectable label; 
30 b) administering the labeled antibody or labeled fragment 

to the patient; and 
c) detecting specifically bound labeled antibody or 

labeled fragment in the patient which indicates the 

presence of streptococcus. 

35 

A further aspect of the invention is the use of the GBS 
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polypeptides of the invention as immunogens for the 
production of specific antibodies for the diagnosis and in 
particular the treatment of streptococcus infection. 
Suitable antibodies may be determined using appropriate 

5 screening methods, for example by measuring the ability of a 
particular antibody to passively protect against 
streptococcus infection in a test model. One example of an 
animal model is the mouse model described in the examples 
herein. The antibody may be a whole antibody or an antigen- 

10 binding fragment thereof and may in general belong to any 
immunoglobulin class. The antibody or fragment may be of 
animal origin, specifically of mammalian origin and more 
specifically of murine, rat or human origin. It may be a 
natural antibody or a fragment thereof, or if desired, a 

15 recombinant antibody or antibody fragment. The term 

recombinant antibody or antibody fragment means antibody or 
antibody fragment which were produced using molecular 
biology techniques. The antibody or antibody fragments may 
be polyclonal, or preferably monoclonal. It may be specific 

20 for a number of epitopes associated with the GBS 
polypeptides but is preferably specific for one. 

EXAMPLE 1 Murine model of lethal Group B Streptococcus (GBS) 
25 infection 

The mouse model of GBS infection is described in detail in 
Lancefield et al (J Exp Med 142:165-179,1975). GBS strain 
C388/90 (Clinical isolate obtained in 1990 from the 

30 cephalorachidian fluid of a patient suffering from 

meningitis, Children's Hospital of Eastern Ontario, Ottawa, 
Canada) and NCS246 (National Center for Streptococcus, 
Provincial Laboratory of Public Health for Northern Alberta, 
Edmonton, Canada) were respectively serotyped as type Ia/c 

35 and type II/R. 
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To increase their virulence, the GBS strains C388/90 
(serotype Ia/c).and NCS 246 (serotype II/R) were serially 
passaged through mice as described previously (Lancefield et 
al. J Exp Med 142:165-179, 1975). Briefly, the increase of 
5 virulence was monitored using intraperitoneal inoculations 
of serial dilutions of a subculture in Todd-Hewitt broth 
obtained from either the blood or spleen of infected mice. 
After the last passage, infected blood samples were used to 
inoculate Todd-Hewitt broth. After an incubation of 2 hours 

10 at 37°C with 7% C0 2/ glycerol at a final concentration of 
10% (v/v) was added to the culture. The culture was then 
aliquoted and stored at -80° C for use in GBS challenge 
experiments. The number of cfu of GBS present in these 
frozen samples was determined. The bacterial concentration 

15 necessary to kill 100% (LD100) of the 18 weeks old mice were 
determined to be 3.5X10 5 and 1.1X10 5 respectively for GBS 
strain C388/90 and NCS246, which corresponded to a 
significant increase in virulence for both strains. Indeed, 
the LD100 recorded before the passages for these two strains 

20 was higher than 10 9 cfu. 

In a bacterial challenge, a freshly thawed aliquot of a 
virulent GBS strain was adjusted to the appropriate 
bacterial concentration using Todd-Hewitt broth and 1ml was 

25 injected intraperitoneally to each female CD-I mouse. The 
mice used for the passive protection experiments were 6 to 8 
weeks old, while the ones used for the active protection 
experiments were approximately 18 weeks old at the time of 
the challenge. All inocula were verified by colony counts. 

30 Animals were observed for any sign of infection four times 
daily for the first 48 h after challenge and then daily for 
the next 12 days. At the end of that period, blood samples 
were obtained from the survivors and frozen at -20°C. The 
spleen obtained from each mouse that survived the challenge 

35 was cultured in order to identify any remaining GBS. 
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EXAMPLE 2 Immunization and protection in mice with 
formaldehyde killed whole GBS cells 

5 Formaldehyde killed GBS whole cells were prepared according 
to the procedures described in Lancefield et al (J Exp Med 
142:165-179,1975). Briefly, an overnight culture on sheep 
blood agar plates (Quelab Laboratories, Montreal, Canada) of 
a GBS strain was washed twice in PBS buffer (phosphate 
10 buffered-saline, pH7.2), adjusted to approximately 3X10 9 

cfu/mL and incubated overnight in PBS containing 0.3% (v/v) 
formaldehyde. The killed GBS suspension was washed with PBS 
and kept frozen at -80°C. 

15 Female CD-I mice, 6 to 8 weeks old (Charles River, St- 
Constant, Quebec, Canada), were injected subcutaneously 
three times at two weeks interval with 0.1 ml of 
formaldehyde killed cells of GBS strain C388/90 (~6X10 7 GBS) , 
or 0..1 ml of PBS for the control group. On the day before 

20 the immunization, Alhydrogel™ (Superfos Biosector, 

Frederikssund, Denmark) at a final concentration of 0.14 mg 
or 0.21 mg of Al, was added to these preparations and 
incubated overnight at 4°C with agitation. Serum samples 
were obtained from each mouse before the beginning of the 

25 immunization protocol and two weeks after the last ■ 
injection. The sera were frozen at -20°C. 

Eight mice in each control group injected with PBS and the 
group immunized with formaldehyde killed whole cells GBS 

30 strain C388/90 (Ia/c) were challenged with 1.5X10 4 cfu of 
GBS strain C388/90 (Ia/c) one week after the third 
injection. All mice immunized with the formaldehyde killed 
GBS whole cells survived the homologous challenge while, 
within 5 days after the challenge, only 4 out of the 8 mice 

35 injected with PBS survived from the infection. In order to 
increase the mortality rate in the control groups, the 
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bacterial suspension had to be adjusted according to the ag< 
of the mice at the time of the bacterial challenge. In 
subsequent challenge experiments, when mice were older than 
15 weeks, the bacterial inoculum was increased to 
5 concentrations between 3.0X10 5 and 2.5X10 6 cfu. 
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Table 1 Immunization of CD1 mice with formaldehyde 
killed whole cells of GBS and subsequent 
homologous challenge {strain C388/90 (Ia/c)] 
and heterologous challenge [strain NCS246 
5 (II/R)]. 



antigenic preparations 
used for immunization 1 


number of living mice 14 days after the bacterial 
challenge 
(% Survival) 


homologous challenge: 
strain C388/90 (la/c) 


heterologous challenge: 
strain NCS246 (ll/R) 


1st infection 


formaldehyde killed cells 
of GBS strain C388/90 
(la/c) 2 


8/8 (100) 3 


n.d. 5 


control PBS 


4/8 (50) 


n.d. 


2nd infection 


formaldehyde killed cells 
of GBS strain C388/90 
(la/c) 


6/6 (100) 4 


0/6 (0) 6 


control PBS 


2/6 (33) 


0/6 (0) 



1 alhydrogel™ at a final concentration of 0.14 mg or 0.21 mg of Al was used; 

2 approximately 6X10 7 cfu; 

3 intraperitoneal challenge with 1 ml_ Todd-Hewitt culture medium containing GBS C388/90 
1 0 ^ (la/c) suspension adjusted to 1 .5X1 0 4 cfu; 

4 intraperitoneal challenge with 1 mL Todd-Hewitt culture medium containing GBS C388/90 

(!a/c) suspension adjusted to 2.1X10 6 cfu; 

5 not done; 

6 intraperitoneal challenge with 1 mL Todd-Hewitt culture medium containing GBS NCS246 
15 (ll/R) suspension adjusted to 1 .2X10 5 cfu. 

In another experiment, one group of 12 mice corresponding 
to a control group was injected with PBS, while a second 
group of 12 mice was immunized with formaldehyde killed 

20 whole cells of GBS strain C388/90 (Ia/c) . Six mice from 
each of these two groups were challenged with 2.1X10 6 cfu 
of the GBS strain C388/90 (Ia/c) (Table I). As the first 
challenge experiment, all mice immunized with the GBS 
strain C388/90 (Ia/c) survived the homologous challenge. 

25 Only two out of the 6 mice injected with PBS survived the 
infection. 
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The remaining 6 mice in both groups were then used one 
week later to verify whether this antigenic preparation 
could confer cross protection against strain NCS246 (Il/R) 
5 which produce a serologically distinct capsule. None of 
the mice infected with this second GBS strain survived the 
infection. The later result suggested that most of the 
protective immune response induced by formaldehyde killed 
strain C388/90 is directed against the capsular 
10 polysaccharide and that it could be restricted to strains 
of that particular serotype. These results clearly 
indicated that this particular model of infection can be 
efficiently used to study the protection conferred by 
vaccination. 

15 

EXAMPLE 3 Immunization of rabbit with formaldehyde killed 
whole GBS cells and passive protection in mice 

A New Zealand rabbit (2.5 kg, Charles River, St-Constant, 
20 Quebec, Canada) was immunized with formaldehyde killed 
cells of GBS strain C388/90 (Ia/c) to obtain hyperimmune 
serum. This rabbit was injected subcutaneously three 
times at three weeks interval with approximately 1.5X10 9 
cfu of formaldehyde killed whole cells of GBS strain 
25 C388/90 (Ia/c). Freund's complete adjuvant (Gibco BRL 

Life Technologies, Grand Island, New York) was used as the 
adjuvant for the first immunization, while Freund's 
incomplete adjuvant (Gibco BRL) was used for the following 
two injections. Serum samples were obtained before the 
30 beginning of the immunization protocol and two weeks after 
the last injection. The sera were frozen at -20°C. 

The ability of this particular rabbit hyperimmune serum to 
passively protect mice against a lethal infection with GBS 
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was also evaluated. Intraperitoneal injection of mice with 
either 15 or 25 fih of hyperimmune rabbit serum 18 hours 
before the challenge protected 4 out of 5 mice (80%) against 
the infection. Comparatively, survival rates lower than 20% 
were recorded for mice in the control group injected with 
PBS or serum obtained from a rabbit immunized with 
meningococcal outer membrane preparation. This result 
clearly indicates that the immunization of another animal 
species with killed GBS cells can induce the production of 
antibodies that can passively protect mice. This reagent 
will also be used to characterize clones. 

Table 2 Passive protection of CD-I mice conferred by 
rabbit serum obtained after immunization with 
formaldehyde killed group B whole streptococci 
(strain C388/90 (Ia/c)) antigenic preparation 



groups 


number of living mice 14 
days after the bacterial 
challenge with GBS strain 
C388/90 (Ia/c) 2 


% 

survival 


rabbit hyperimmune 
serum 2 
- 25 Ml 


4/5 


80 


rabbit hyperimmune 
serum 1 
- 15 Ml 


4/5 


80 


control rabbit serum 
- 25 Ml 


1/5. 


20 


control PBS 


1/10 


10 



1 Freund's complete adjuvant was used for first 
immunization, and Freund's incomplete adjuvant for the 
following two injections; 

2 intraperitoneal challenge with 1 ml Todd-Hewitt culture 
medium containing GBS C388/90 (Ia/c) suspension adjusted 
to 2X10 4 cfu. 
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EXAMPLE 4 Recombinant production of His . Tag-GBS fusion 
protein 

The coding region of a GBS gene was amplified by PCR (DNA 
5 Thermal Cycler GeneAmp PCR system 2400 Perkin Elmer, San 
Jose, CA) from the genomic DNA of GBS strain C388/90 (Ia/c) 
using the oligos that contained base extensions for the 
addition of the restriction sites Bglll (AGATCT) and Hindi I I 
(AAGCTT) , respectively. The PCR product was purified from 

10 agarose gel using a Qiaex II gel extraction kit from Qiagen 
(Chatsworth, CA) , digested with the restriction enzymes 
Bglll and Hindlll (Pharmacia Canada Inc Baie d'Urfe, 
Canada), and extracted with phenol : chloroform before ethanol 
precipitation. The pET-32b(+) vector (Novagen, Madison, WI) 

15 containing the thioredoxin-His .Tag sequence was- digested 
with the restriction enzymes Bglll and Hindlll, extracted 
with phenol: chloroform, and then ethanol precipitated. The 
Bglll -Hindlll genomic DNA fragment was ligated to the Bglll- 
Hindlll pET-32b(+) vector to create the coding sequence for 

20 thioredoxin-His. Tag-GBS fusion protein whose gene was under 
control of the T7 promoter. The ligated products were 
transformed into E. coli strain XLI Blue MRF ' (A (mcrA) 183 A 
(/ncrCB-hsdSMR-jnrr) 173 endAl supE44 thi-l recAl gyrA96 relAl 
lac [F 1 proAB lad q ZAM15TnlO (Tet r )] c )' {Stratagene, La Jolla, 

25 CA) according to the method of Simanis (Hanahan, D. DNA 
Cloning, 1985, D.M. Glover (ed.), pp. 109-135). The 
recombinant pET plasmid was purified using a Qiagen kit 
(Qiagen, Chatsworth, CA) and the nucleotide sequence of the 
DNA insert was verified by DNA sequencing (Taq Dye Deoxy 

30 Terminator Cycle Sequencing kit, ABI, Foster City, CA) . The 
recombinant pET plasmid was transformed by electroporation 
(Gene Pulser II apparatus, BIO-RAD Labs, Mississauga, 
Canada) into E. coli strain AD494 (DE3) (Aara~2eu7697 
AlacX74 AphoA PvuII phoR A/na2F3 F ! [lac* (2acl q ) pro] 

35 trxB::Kan (DE3) ) (Novagen, Madison, WI) . In this strain of 
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E. coli, the T7 promoter controlling expression of the 
fusion protein, is specifically recognized by the T7 RNA 
polymerase (present on the XDE3 prophage) whose gene is 
under the control of the lac promoter which is inducible by 
5 isopropyl-p-D-thio-galactopyranoside (IPTG) . 

The transformant AD494 (DE3 ) /rpET was grown at 37°C with 
agitation at 250 rpm in LB broth (peptone lOg/L, Yeast 
extract 5g/L, NaCl lOg/L) containing lOOpg of ampicillin 

10 (Sigma-Aldrich Canada Ltd., Oakville, Canada) per mL until 
the Ag 00 reached a value of 0.6. In order to induce the 
production of the thioredoxin-His .Tag-GBS fusion protein, 
the cells were incubated for 2 additional hours in the 
presence of IPTG at a final concentration of ImM. The 

15 bacterial cells were harvested by centrifugation. 

The recombinant fusion protein produced by AD494 (DE3) /rpET32 
upon IPTG induction for 2h was partially obtained as 
insoluble inclusion bodies which were purified from 

20 endogenous E. coli proteins by the isolation of insoluble 
aggregates (Gerlach, G.F. et al 1992, Infect. Immun. 
60:892) . Induced cells from a 500 mL culture were 
resuspended in 20 mL of 25% sucrose-50mM Tris-HCl buffer 
(pH8.0) and frozen at -70°C. Lysis of cells in thawed 

25 suspension was achieved by the addition of 5mL of a solution 
of lysozyme (lOmg/mL) in 250mM Tris-HCl buffer (pH8.0) 
followed by an incubation of 10 to 15 min on ice, and the 
addition of 150mL of detergent mix (5 parts of 20mM Tris-HCl 
buffer [pH7.4] -300mM NaCl-2% deoxycholic acid-2% Nonidet P- 

30 40 and 4 parts of lOOmM Tris-HCl buffer [pH8]-50mM EDTA-2% 
Triton X-100) followed by 5 min incubation on ice. Upon 
sonication, protein aggregates were harvested by 
centrifugation for 30 min at 35,000 X g and a sample of the 
soluble cellular fraction was kept. The aggregated proteins 

35 were solubilized in 6M guanidine hydrochloride. The 
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presence of the fusion protein in both the soluble and 
insoluble fractions was shown by Western Blot analysis using 
the serum of a mouse injected with formaldehyde killed cells 
of GBS strain C388/90 (Ia/c) that survived a bacterial 
5 challenge with the corresponding GBS strain. 

The purification of the fusion protein from the soluble 
fraction of IPTG-induced AD494 (DE3) /rpET was done by 
affinity chromatography based on the properties of the 

10 His. Tag sequence (6 consecutive histidine residues) to bind 
to divalent cations (Ni 2+ ) immobilized on the His. Bind metal 
chelation resin (Novagen, Madison, WI) . The purification 
method used are those described in the pET system Manual, 
6th Edition (Novagen, Madison, WI) . Briefly, the pelleted 

15 cells obtained from a lOOmL culture induced with IPTG was 
resuspended in 4mL of Binding buffer (5mM imidazole-SOOmM 
NaCl-20mM Tris-HCl pH7.9), sonicated, and spun at 39,000 X g 
for 20 min to remove debris. The supernatant was filtered 
(0.45nm pore size membrane) and deposited on a column of 

20 His. Bind resin equilibrated in Binding buffer. The column 
was then washed with 10 column volumes of Binding buffer 
followed by 6 column volumes of Wash buffer (20mM imidazole- 
SOOmM NaCl-20mM Tris-HCl pH7.9). The thioredoxin-His .Tag- 
GBS fusion protein was eluted with Elute buffer (1M 

25 imidazole-500mM NaCl-20mM Tris-HCl pH7.9). The removal of 
the salt and imidazole from the sample was done by dialysis 
against 3X1 liter PBS at 4°C. 

The quantities of fusion protein obtained from either the 
30 soluble or insoluble cytoplasmic fractions of E. coli were 
estimated by Coomassie staining of a sodium dodecyl sulfate 
(SDS) -polyacrylamide gel with serial dilutions of these 
proteins and a bovine serum albumin standard (Pierce 
Chemical Co. Rockford, 111.). 

35 
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EXAMPLE 5 Recombinant production of GBS protein under 

control of lambda P L promoter 

The DNA coding region of a GBS protein was inserted 
5 downstream of the promoter A.P L into the translation vector 
pURV22. This plasmid was derived from p629 (George et al, 
1987, Bio/Technology 5:600) from which the coding region for 
a portion of the herpes simplex virus type I (HSV-I) 
glycoprotein (gD-1) was removed and the ampicillin 

10 resistance gene replaced by a kanamycin cassette obtained 
from the plasmid vector pUC4K (Pharmacia Biotech Canada 
Inc., Baie D'Urfe, Canada). The vector contained a cassette 
of the bacteriophage X cI857 temperature sensitive repressor 
gene from which the functional P R promoter had been deleted. 

15 The inactivation of the cI857 repressor by temperature 

increase from the ranges of 30-37°C to 37-42°C resulted in 
the induction of the gene under the control of X P L . The 
translation of the gene was controlled by the ribosome 
binding site cro followed downstream by a Bglll restriction 

20 site (AGATCT) and the ATG : ACTAAGGAGGTTAGATCTATG . 

Restriction enzymes and T4 DNA ligase were used according to 
suppliers (Pharmacia Biotech Canada Inc., Baie D'Urfe, 
Canada; and New England Biolabs Ltd., Mississauga, Canada). 

25 Agarose gel electrophoresis of DNA fragments was performed 
as described by Sambrook et al . ( Molecular cloning : A 
laboratory Manual, 1989, Cold Spring Harbor Laboratory 
Press, N.Y) . Chromosomal DNA of the GBS bacteria was 
prepared according to procedures described in Jayarao et al 

30 (J. Clin. Microbiol., 1991, 29:2774). DNA amplification 

reactions by polymerase chain reaction (PCR) were made using 
DNA Thermal Cycler GeneAmp PCR system 2400 (Perkin Elmer, 
San Jose, CA) . Plasmids used for DNA sequencing were 
purified using plasmid kits from Qiagen (Chatsworth, CA) . 

35 DNA fragments were purified from agarose gels using Qiaex II 
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gel extraction kits from Qiagen (Chatsworth, CA) . Plasmid 
transformations were carried out by the method described by 
Hanahan ( DNA Cloning . Glover (ed.) pp, 109-135, 1985). The 
sequencing of genomic DNA inserts in plasmids was done using 
5 synthetic oligonucleotides which were synthesized by 
oligonucleotide synthesizer model 394 (the Perkin-Elmer 
Corp., Applied Biosystems Div. (ABI) , Foster City, CA) . The 
sequencing reactions were carried out by PCR using the Taq 
Dye Deoxy Terminator Cycle Sequencing kit (ABI, Foster City, 

10 CA) and DNA electrophoresis was performed on automated DNA 
sequencer 373A (ABI, Foster City, CA) . The assembly of the 
DNA sequence was performed using the program Sequencer 3.0 
(Gene Codes Corporation, Ann Arbor, MI) . Analysis of the DNA 
sequences and their predicted polypeptides was performed 

15 with the program Gene Works version 2.45 (Intelligenetics, 
Inc . , Mountain View CA) . 

The coding region of the GBS gene was amplified by PCR from 
GBS strain C388/90 (Ia/c) genomic DNA using oligos that 

20 contained base extensions for the addition of restriction 

sites Bglll (AGATCT) and Xbal (TCTAGA) , respectively. The PCR 
product was purified from agarose gel using a Qiaex II gel 
extraction kit from Qiagen (Chatsworth, CA) , digested with 
the restriction enzymes Bglll and Xbal, and extracted with 

25 phenol: chloroform before ethanol precipitation. The pURV22 
vector was digested with the restriction enzymes Bglll and 
Xbal, extracted with phenol : chloroform, and ethanol 
precipitated. The Bglll -Xbal genomic DNA fragment was 
ligated to the Bglll -Xbal pURV22 vector in which the GBS 

30 gene was under the control of the XPL promoter. The ligated 
products were transformed into E. coli strain XLI Blue MRF 1 
(A (mcrA) 183A (rocrCB-hsdSMR-mrr) 173 endAl supE44 thi-1 recAl 
gyrA96 relAl lac[F» proAB lacl q ZAM15 TnlO <Tet r ) ] c ) 
(Stratagene, La Jolla CA) according to the methods described 

35 in Hanahan, supra . Transf ormants harboring plasmids with the 
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insert were identified by analysis of lysed cells submitted 
to electrophoresis on agarose gel (Sambrook et al, supra ) . 
The recombinant pURV22 plasmid was purified using a Qiagen 
kit (Qiagen, Chatsworth, CA) and the nucleotide sequence of 
5 the DNA insert was verified by DNA sequencing. 

The transformant XL I Blue MRF f /rpURV22 was grown at 34 °C 
with agitation at 250 rpm in LB broth containing S0\xg of 
kanamycin per mL until the Ag 00 reached a value of 0.6. In 
10 order to induce the production of the fusion protein, the 
cells were incubated for 4 additional hours at 39°C. The 
bacterial cells were harvested by centrif ugation , 
resuspended in sample buffer, boiled for 10 min and kept at 
-20°C. 

15 

EXAMPLE 6 Subcloning GBS protein gene in CMV plasmid pCMV-GH 

The DNA coding region of a GBS protein was inserted in phase 
20 downstream of the human growth hormone (hGH) gene which was 
under the transcriptional control of the cytomegalovirus 
(CMV) promoter in the plasmid vector pCMV-GH (Tang et al, 
Nature, 1992, 356:152). The CMV promoter is non functional 
in E. coli cells but active upon administration of the 
25 plasmid in eukaryotic cells. The vector also incorporated 
the ampicillin resistance gene. 

The coding region of the gene was amplified by PCR from 
genomic DNA of GBS strain C388/90 (Ia/c) using the oligos 

30 that contained base extensions for the addition of the 

restriction sites Bglll (AGATCT) and Hindlll (AAGCTT) . The 
PCR product was purified from agarose gel using a Qiaex II 
gel extraction kit from Qiagen (Chatsworth, CA) , digested 
with the restriction enzymes Bglll and Hindlll, and 

35 extracted with phenol : chloroform before ethanol 

precipitation. The pCMV-GH vector (Laboratory of Dr. Stephen 
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A. Johnston, Department of Biochemistry, The University of 
Texas, Dallas, Texas) containing the human growth hormone to 
create fusion proteins was digested with the restriction 
enzymes BamHI and Hindi II, extracted with phenol : chloroform, 
5 and ethanol precipitated. The 1.3-kb Bglll-Hindlll genomic 
DNA fragment was ligated to the BamHI -Hindi I I pCMV-GH 
vector to create the hGH-GBS fusion protein under the 
control of the CMV promoter. The ligated products were 
transformed into E. coli strain DH5a[<|>80 2acZ AM15 endAl 

10 recAl hsdRll ( r K" ra lT) supE44 fchi-lA/ gyrASG relAl A(2acZYA- 
argF)U169] (Gibco BRL, Gaithersburg, MD) according to the 
methods described by Hanahan, supra . Transf ormants 
harboring plasmids with the insert were identified by 
analysis of lysed cells submitted to electrophoresis on 

15 agarose gel (Sambrook, J. et al , supra ) . The recombinant 
pCMV plasmid was purified using a Qiagen kit (Qiagen, 
Chatsworth, CA) and the nucleotide sequence of the DNA 
insert was verified by DNA sequencing. 

20 

EXAMPLE 7 Immunological activity of GBS protein to GBS 
challenge 

Four groups of 12 female CD-I mice (Charles River, St- 
25 Constant, Quebec, Canada) of 6 to 8 weeks were injected 
subcutaneously three times at three week intervals with 
O.lmL of the following antigenic preparations: formaldehyde 
killed cells of GBS strain C388/90 (~6X10 7 cfu) , 20fig of 
thioredoxin-His.Tag-GBS fusion protein obtained from the 
30 insoluble (inclusion bodies) or 20^g of the fusion protein, 
affinity purified (nickel column) , from the soluble 
cytoplasmic fraction in E.coli, or 20fig of affinity purified 
(nickel column) thioredoxin-His .Tag control polypeptide. 
20^ig of QuilA™ (Cedar lane Laboratories Ltd, Hornby, Canada) 
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was added to each antigenic preparation as the adjuvant. 
Serum samples were obtained from each mouse before 
immunization (PB) and on days 20 (TBI), 41 (TB2) and 54 
(TB3) during the immunization protocols. Sera were frozen 
5 at -20°C. 

An increase of the ELISA titers was recorded after each 
injection of the fusion protein indicating a good primary- 
response and a boost of the specific humoral immune response 

10 after each of the second and third administration. At the 
end of the immunization period, the means of reciprocal 
ELISA titers was 456,145 for the group immunized with 20^g 
of fusion protein obtained from inclusion bodies compared to 
290,133 for the group of mice immunized with the protein 

15 from soluble fraction in E.coli. The latter result suggests 
that the protein obtained from inclusion bodies could be 
more immunogenic than the soluble protein. Analysis of mice 
sera in ELISA using the affinity purified thioredoxin- 
His.Tag to coat plates showed that negligible antibody 

20 titers are made against the thioredoxin-His .Tag portion of 
the fusion protein. The reactivity of the sera from mice 
injected with the recombinant fusion protein was also tested 
by ELISA against formaldehyde killed whole cells of GBS 
strain C388/90. The antibodies induced by immunization with 

25 recombinant fusion protein also recognized their specific 
epitopes on GBS cells indicating that their conformation is 
close enough to the native streptococcal protein to induce 
cross-reactive antibodies. 

30 To verify whether the immune response induced by 

immunization could protect against GBS infection, mice were 
challenged with 3.5X10 5 cfu of GBS strains C338/90 (Ia/c) and 
1.2X10 5 cfu of strain NCS246UI/R) the results of which are 
illustrated in tables 3 and 4 respectively. Mice immunized 

35 with control thioredoxin-His .Tag peptide were not protected 
against challenge with either GBS strain while those 
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immunized with formaldehyde killed C388/90 whole cells only 
provided protection against homologous challenge. The 
thioredoxin-His.Tag-GBS fusion protein of the invention 
protected mice from challenge with both GBS strains. Blood 
and spleen culture of these mice did not reveal the presence 
of any GBS . 
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Table 3 Survival from GBS strain C388/90 (Ia/c) challenge 1 



: 

immunizing agent 

i 


no . mice 

surviving 

challenge 


| % 
S survival 


thioredoxin-His . Tag 2 


1 / 6 


17 


formaldehyde killed C388/90 
cells 3 | 


6/6 


i 

100 


i 

thioredoxin-His.Tag-GBS fusion j 
(inclusion body preparation) 4 

! 


6/6 


j 

100 

1 , — 


thioredoxin-His.Tag-GBS fusion j 
(cytoplasmic fraction) 4 


6/6 


: 
! 

| 100 



1 intraperitoneal administration with 1 ml Todd-Hewitt 
5 culture medium adjusted to 3.5X10 5 cfu; 

2 20|ig administered; posterior legs paralyzed in surviving 
mouse; GBS detected in blood and spleen; 

3 6X10 7 cfu administered; 

4 20|ig administered. 
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Table 4 Survival from GBS strain 


NCS246 (II/R) 


challenge 1 


1 

immunizing agent 1 

1 


1 
i 

no. mice j 
surviving 
challenge j 


% survival 


! 

thioredoxin-His . Tag 2 

I 


0/6 ! 

! 


0 


1 

formaldehyde killed C3 88/90 

ucii o : 

! 


2/6 I 

J 


34 


i 

thioredoxin-His. Tag-GBS fusion j 
(inclusion body preparation) 2 \ 

i 


5 / 5 4 | 

i 

: 


100 


thioredoxin-His. Tag-GBS fusion ! 
(cytoplasmic fraction) 2 


~f~ 

6/6 | 


100 



5 intraperitoneal administration with 1 ml Todd-Hewitt 
culture medium containing GBS NCS246(II/R) suspension 
adjusted to 1.2X10 5 cfu. 

2 20jig administered; 

3 6X10 7 cfu administered; 

10 4 one mouse died during immunization. 

EXAMPLE 8 Immunization with recombinant GBS protein confers 
protection against experimental GBS infection 

15 

This example illustrates the protection of mice against 
fatal GBS infection by immunization with the recombinant 
protein corresponding to the SEQ ID NO: 39. 

20 Groups of 10 female CD-I mice (Charles River) were immunized 
subcutaneously three times at three-week intervals with 20 
]ig of recombinant protein purified from E^ coli strain BLR 
(Novagen) harboring the recombinant pURV22 plasmid vector 
containing the GBS gene corresponding to SEQ ID NO: 42 in 

25 presence of 20 pg of QuilA™ adjuvant (Cedarlane 

Laboratories Ltd, Hornby, Canada) or, as control, with 

34 



WO 99/42588 



PCT/CA99/00114 



QuilA™ adjuvant alone in PBS. Blood samples were collected 
from the orbital sinus on day 1, 22 and 43 prior to each 
immunization and fourteen days (day 57) following the third 
injection. One week later the mice were challenged with 
approximately 10 4 to 10 6 CFU of various virulent GBS strains. 

Samples of the GBS challenge inoculum were plated on TSA/5% 
sheep blood agar plates to determine the CFU and to verify 
the challenge dose. Deaths were recorded for a period of 14 
days and on day 14 post-challenge, the surviving mice were 
sacrificed and blood and spleen were tested for the presence 
of GBS organisms. The survival data are shown in table 5. 

Prechallenge sera were analyzed for the presence of 
antibodies reactive with GBS by standard immunoassays. Elisa 
and immunoblot analyses indicated that immunization with 
recombinant GBS protein produced in E. coli elicited 
antibodies reactive with both, recombinant and native GBS 
protein. Antibody responses to GBS are described in Example 
9. 
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Table 5. Ability of recombinant GBS protein corresponding 
to SEQ ID NO: 39 to elicit protection against 8 diverse GBS 
challenge strains 





Challenge strain 




Immunogeri 


Designation 


Type 


No. alive: No. dead 1 


rGBS protein 
none 


C388/90 


Ia/c 


8 : 2 (P<0.0001) 
0 : 10 


rGBS protein 
none 


NCS 246 


II/R 


10 : 0 (P=0.0012) 
3 : 7 


rGBS protein 
none 


ATCC12401 


lb 


10 : 0 (P=0.001) 
3 : 7 


rGBS protein 
none 


NCS 535 


V 


10 : 0 (P=0.01) 
5 : 5 


rGBS protein 
none 


NCS 9842 


VI 


10 : 0 (P<0.0001) 
0 : 10 


rGBS protein 
NCS 915-F 3 
none 


NCS 915 


III 


7 : 3 (P=0.0007) 2 
1:9 

4 : 6 


rGBS protein 
NCS 954-F 
none 


NCS 954 


III/R 


7:3 ( P=0.002) i 
4 : 6 
1 : 9 


rGBS protein 

C0H1-F 

none 


COH1 


III 


4 : 6 (P=0.0004) 
3 : 7 
0 : 10 



Groups of 10 mice per group were used, the number of mice 
surviving to infection and the number of dead mice are 
indicated. The survival curves corresponding to recombinant 
GBS protein-immunized animals were compared to the survival 
curves corresponding to mock-immunized animals using the log- 
rank test for nonparametric analysis. 
2 Comparison analysis to NCS915-F-immunized animals. 

Animals were immunized with formaldehyde-killed GBS in 
presence of QuilA™ adjuvant. 

All hemocultures from surviving mice were negative at day 14 
post-challenge. Spleen cultures from surviving mice were 
negative except for few mice from experiment MB-11. 
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EXAMPLE 9 Vaccination with the recombinant GBS protein 
elicits an immune response to GBS 

Groups of 10 female CD-I mice were immunized subcutaneously 
5 with recombinant GBS protein corresponding to SEQ ID NO: 39 
as described in Example 8. In order to assess the antibody 
response to native GBS protein, sera from blood samples 
collected prior each immunization and fourteen days after 
the third immunization were tested for antibody reactive 

10 with GBS cells by ELISA using plates coated with 

formaldehyde-killed GBS cells from type III strain NCS 954, 
type lb strain ATCC12401, type V strain NCS 535 or type VI 
strain NCS 9842. The specificity of the raised antibodies 
for GBS protein was confirmed by Western blot analyses to 

15 GBS cell extracts and purified recombinant antigens. The 

results shown in Figure 10 clearly demonstrate that animals 
respond strongly to recombinant GBS protein used as 
immunogens with median reciprocal antibody titers varying 
between 12000 and 128000, for sera collected after the third 

20 immunization, depending of the coating antigen. All 

preimmune sera were negative when tested at a dilution of 
1 :100. GBS-reactive antibodies were detectable in the sera 
of each animal after a single injection of recombinant GBS 
protein. 

25 
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Example 10 Antigenic conservation of the GBS protein of the 
present invention 

Monoclonal antibodies (MAbs) specific to the GBS protein of 
5 the present invention were used to demonstrate that this 
surface antigen is produced by all GBS and that it is also 
antigenically highly conserved. 

A collection of 68 GBS isolates was used to evaluate the 

10 reactivity of the GBS-specific MAbs. These strains were 
obtained from the National Center for Streptococcus, 
Provincial Laboratory of Public Health for Northern Alberta, 
Canada; Centre Hospital ier Universitaire de Quebec, Pavilion 
CHUL, Quebec, Canada; American Type Culture Collection, USA; 

15 Laboratoire de Sante Publique du Quebec, Canada; and Dept. 
of Infectious Disease, Children's Hospital and Medical 
Center, Seattle, USA. All eight Mabs were tested against 
the following panel of strains: 6 isolates of serotype la or 
Ia/c, 3 isolates of serotype lb, 4 isolates of serotype II, 

20 14 isolates of serotype III, 2 isolates of serotype IV, 2 
isolates of serotype V, 2 isolates of serotype VI, 2 
isolates of serotype VII, l isolate of serotype VIII, 10 
isolates that were not serotyped and 3 bovine S. agalactiae 
strains. MAb 3A2 was also reacted with additional GBS: 9 

25 isolates of serotype Ia/c and 10 isolates of serotype V. 
The strains were grown overnight on blood agar plates at 
37°C in an atmosphere of 5% C0 2 . Cultures were stored at - 

70°C in heart infusion broth with 20% (v/v) glycerol. 

30 To obtain the GBS protein-specific MAbs, mice were immunized 
three times at three-week intervals with 20 fig of purified 
recombinant GBS protein. (SEQ ID NO :44) in the presence of 
20% QuilA™ adjuvant. Hybridoma cell lines were generated by 
fusion of spleen cells recovered from immunized mice with 

35 the nonsecreting SP2/0 myeloma cell line as described 
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previously (Hamel, J. et al . 1987. J. Med. Microbiol. 
23:163-170). Hybrid clone supernatants were tested for 
specific antibody production by EL ISA using formaldehyde 
inactivated GBS and purified recombinant GBS protein (SEQ ID 
5 NO :39 or 44) as coating antigen, as previously described 
(Hamel/ J. et al . 1987. J. Med. Microbiol. 23:163-170). 
Specific hybrid were cloned by limiting dilutions, expanded, 
and frozen in liquid nitrogen. Production of recombinant GBS 
protein was presented in Examples 4 & 5. Purified 

10 recombinant GBS protein or formaldehyde inactivated GBS were 
resolved by electrophoresis by using the discontinuous 
buffer system of Laemmli as recommended by the manufacturer 
and then transfer onto nitrocellulose membrane for Western 
immunoblotting as described previously (Martin et al. 1992. 

15 Infect. Immun. 60:2718-2725). 

Western immunoblotting experiments clearly indicated that 
all eight MAbs recognized a protein band that corresponded 
to the purified recombinant GBS protein (SEQ ID NO :39) . 

20 These MAbs also reacted with a protein band present in every 
GBS isolates tested so far. The reactivity of these GBS- 
specific MAbs are presented in Table 6. Each MAb reacted 
well with all 46 GBS. In addition, these MAbs also 
recognized the 3 S. agalactiae strains of bovine origin that 

25 were tested. MAb 3A2 also recognized nineteen GBS; 9 

isolates of serotype Ia/c and 10 of serotype V. The other 
MAbs were not tested against these additional strains. 

These results demonstrated that the GBS protein (SEQ ID 
30 NO :39) was produced by all the 65 GBS and the three 3 S. 

agalactiae strains of bovine origin that were tested so far. 
More importantly, these results clearly demonstrated that 

the epitopes recognized by these eight GBS-specific MAbs 

were widely distributed and conserved among GBS. These 
35 results also indicated that these epitopes were not 
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restricted to serologically related isolates since 
representatives of all known GBS serotypes including the 
major disease causing groups were tested. 

In conclusion, the data presented in this example clearly 
demonstrated that the GBS protein of the present invention 
is produced by all GBS and that it is antigenically highly 
conserved. 
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WE CLAIM: 

1. An isolated polynucleotide encoding a polypeptide 
having at least 70% identity to a second polypeptide 
having a sequence selected from the group consisting 
of: 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:15, 
SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, 
SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives thereof . 

2. A polynucleotide according to claim 1, wherein said 
polynucleotide encodes a polypeptide having at least 
95% identity to the second polypeptide. 

3. An isolated polynucleotide encoding a polypeptide 
capable of generating antibodies having binding 
specificity for a polypeptide having a sequence 
selected from the group consisting of : 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:15, 
SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives thereof. 
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4. An isolated polynucleotide that is complementary to the 
polynucleotide of claim 1. 

5. An isolated polynucleotide that is complementary to the 
polynucleotide of claim 3. 

6. The polynucleotide of claim 1, wherein said 
polynucleotide is DNA. 

7. The polynucleotide of claim 3, wherein said 
polynucleotide is DNA. 

8. The polynucleotide of claim 1, wherein said 
polynucleotide is RNA. 

9. The polynucleotide of claim 3, wherein said 
polynucleotide is RNA. 

10. A polynucleotide which hybridizes under stringent 
conditions to a second polynucleotide having a sequence 
selected from the group consisting of : 

SEQ ID NO : 1, SEQ ID NO : 7, SEQ ID NO : 13, SEQ ID 
NO : 22,. SEQ ID NO : 27, SEQ ID NO : 32, SEQ ID NO : 
37, SEQ ID NO : 42 and SEQ ID NO : 43 or fragments, 
analogues or derivatives thereof. 

11. A polynucleotide which hybridizes under stringent 
conditions to a second polynucleotide having a sequence 
selected from the group consisting of : 

SEQ ID NO : 37, SEQ ID NO : 42 and SEQ ID NO : 43 . 

12. A polynucleotide according to claim 11 which hybridizes 
under stringent conditions to a second polynucleotide 
having the sequence SEQ ID NO : 37. 
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13 . A polynucleotide according to claim 11 which hybridizes 
under stringent conditions to a second polynucleotide 
having the sequence SEQ ID NO : 42. 

14. A polynucleotide according to claim 11 which hybridizes 
under stringent conditions to a second polynucleotide 
having the sequence SEQ ID NO : 43. 

15. A polynucleotide according to claim 10 wherein said 
polynucleotide has at least 95% complementarity to the 
second polynucleotide. 

16. A polynucleotide according to claim 11 wherein said 
polynucleotide has at least 95% complementarity to the 
second polynucleotide. 

17. A vector comprising the polynucleotide of claim 1, 
wherein said polynucleotide is operably linked to an 
expression control region. 

18. A vector comprising the polynucleotide of claim 3, 
wherein said polynucleotide is operably linked to an 
expression control region. 

19. A host cell transfected with the vector of claim 17. 

20. A host cell transfected with the vector of claim 18. 

21. A process for producing a polypeptide comprising 
culturing a host cell according to claim 19 under 
conditions suitable for expression of said polypeptide. 

22. A process for producing a polypeptide comprising 
culturing a host cell according to claim 20 under 
condition suitable for expression of said polypeptide. 
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23. An isolated polypeptide having at least 70% identity to 
a second polypeptide having a sequence selected from 
the group consisting of: 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO: 15, 
SEQ ID NO:16 # SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives thereof. 

24. The isolated polypeptide of claim 23 having a sequence 
according to SEQ ID NO : 39. 

25. The isolated polypeptide of claim 23 having a sequence 
according to SEQ ID NO : 44. 

26. An isolated polypeptide capable of generating 
antibodies having binding specificity for a second 
polypeptide having a sequence selected from the group 
consisting of : 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:15, 
SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, 
SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, 
SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, 
SEQ ID NO: 40, SEQ ID NO: 41 and SEQ ID NO: 44 or 
fragments, analogs or derivatives thereof. 
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27. The isolated polypeptide of claim 26 having a sequence 
according to SEQ ID NO : 39. 

28. The isolated polypeptide of claim 26 having a sequence 
according to SEQ ID NO : 44 . 

29. An isolated polypeptide having an amino acid sequence 
selected from the group consisting of: 

SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:15, 
SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, 
SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO:30 f SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID.NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, 
SEQ ID NO: 40 and SEQ ID NO: 41 or fragments, analogs or 
derivatives thereof. 

30. The isolated polypeptide of claim 29 having an amino 
acid sequence according to SEQ ID NO : 39. 

31. An isolated polypeptide having an amino acid sequence 
according to SEQ ID NO : 44 . 

32. An isolated polypeptide according to any one of claims 
29 to 31, wherein the N-terminal Met residue is 
deleted. 

33. An isolated polypeptide according to any one of claims 
29 to 30, wherein the secretory amino acid sequence is 
deleted. 

34. A vaccine composition comprising a polypeptide according 
to any one of claims 23 to 31 and a pharmaceutical^ 
acceptable carrier, diluent or adjuvant. 
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35. A vaccine composition comprising a polypeptide 
according to claim 32 and a pharmaceutical^ acceptable 
carrier , diluent or adjuvant. 

36. A vaccine composition comprising a polypeptide 
according to claim 33 and a pharmaceutically acceptable 
carrier, diluent or adjuvant. 

37. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of a composition according to claim 
34 . 

38. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of a composition according to claim 
35. 

39. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of a composition according to claim 
36. 

40. A method according to any one of claims 37 to 39, 
wherein said animal is a bovine. 

41. A method according to any one of claims 37 to 39, 
wherein said animal is a human. 
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42. A method according to any one of claims 37 to 39, 
wherein said bacterial infection is selected from the 
group consisting of group A streptococcus and group B 
streptococcus . 

43. A method according to claim 42, wherein said bacterial 
infection is group B streptococcus. 



Use of a vaccine composition according to claim 34 for 
the therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to or infected with streptococcal infection 
comprising administering to said animal a therapeutic 
or prophylactic amount of the composition. 



45. Use of a vaccine composition according to any one of 
claims 35 to 36 for the therapeutic or prophylactic 
treatment of streptococcal bacterial infection in an 
animal susceptible to or infected with streptococcal 
infection comprising administering to said animal a 
therapeutic or prophylactic amount of the composition. 



46. Use of a vaccine composition according to any one 

claims 23 to 31 for the manufacture of a vaccine for 
the therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an animal 
susceptible to streptococcal infection comprising 
administering to said animal a therapeutic or 
prophylactic amount of the composition. 

47. Use of a vaccine composition according to claim 32 f 
the manufacture of a vaccine for the therapeutic or 
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prophylactic treatment of streptococcal bacterial 
infection in an animal susceptible to streptococcal 
infection comprising administering to said animal a 
therapeutic or prophylactic amount of the composition. 

48. Use of a vaccine composition according to claim 33 for 
the manufacture of a vaccine for the therapeutic or 
prophylactic treatment of streptococcal bacterial 
infection in an animal susceptible to streptococcal 
infection comprising administering to said animal a 
therapeutic or prophylactic amount of the composition. 
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1 ATCTGGCAA 


AGAGCCAGCT 


AATCGTTTTA 


GTTGGGCTAA 


AAATAAATTA 


TTAATCAATG 


60 


S G K 


EPA 


N R F S 


W A K 


N K L 


LING 




GA1 1 LAJ. 1 GL 


AAL I G 1 AGGA 


GLAAG 1ATCT 


TATTTTTTGC 


AGTTCAATTC 


ATAGGTCTTA 


120 


F I A 


TLA 


A T I L 


F F A 


V Q F 


I G L K 


n a P p n P A T T Zi 


1 uGAAAA 


ACL 1 AG 1 1 TA 


TTATCCTATT 


GACAGCATGG 


ACTTTGATGG 


180 


P D Y 


P G K 


T Y F I 


ILL 


TAW 


T L M A 


PATTAP.TZVZVP 


TPPTTTAPTP 
1 yU 111 AVj 1 0 




Al AGGTATGG 


TTCCTTCTTG 


TCGTTATTAA 


240 


L V T 


A L V 


G W D N 


R Y G 


S F L 


S L L I 


*P TV *P*F TV Tf TV TT 

Inl lrvi inl 1 


GGAGG1 1GG1 


1 GAAGCGCAG 


GAACTTACCC 


AATAGAATTG 


AGTCCTAAGT 


300 


L L F 


Q L G 


S S A G 


T Y P 


I E L 


S P K F 




TCTTTPAAAC 




TTTTTAPPPA 


TPAPTTZiPTP 
1 VjMC 1 1 AG 1 G 


1 G 1 1 1 GAGGA 


TTAAGAGAGA 


360 


F Q T 


I Q P 


F L P M 


T Y S 


V S G 


L R E T 




CCATCTCGTT 


GACGGGAGAC 


GTTAACCATC 


AATGGAGAAT 


GCTAGTAATC 


TTTTTAGTAT 


420 


I S L 


T G D 


V N H Q 


W R M 


L V I 


F L V S 




CATCGATGAT 


ACTTGCTCTT 


CTTATTTATC 


GTAAACAAGA 


AGATTAATAG 


AAAGTATCTA 


480 


SMI 


L A L 


L I Y R 


K Q E 


D 




GTGATAGACT 


AACAGTATGA 


TATGGTATGT 


CAAAGTATTT 


AGGAGGAGAA 


GATATGTCTA 


540 












M S T 
I > 




CTTTAACAAT 


AATTATTGCA 


ACATTAACTG 


CTTTGGAACA 


TTTTTATATT 


ATGTATTTGG 


600 


L T I 


I I A 


T L T A 


L E H 


F Y I 


M Y L E 





AGACGTTAGC CACCCAGTCA AATATGACTG GGAAGATTTT TAGTATGTCT AAAGAAGAGT 660 
TLA TQS NMTG K I F SMS KEEL 



TGTCATATTT ACCCGTTATT AAACTTTTTA AGAATCAAGG TGTATACAAC GGCTTGATTG 720 
SYL P V I KLFK NQG V Y N GLIG 

GCCTATTCCT CCTTTATGGG TTATATATTT CACAGAATCA AGAAATTGTA GCTGTTTTTT 780 
L F L LYG LYIS Q N Q EIV AVFL 

TAATCAATGT ATTGCTAGTT GCTATTTATG GTGCTTTGAC AGTTGATAAA AAAATCTTAT 840 
I N V LLV AIYG ALT V.DK KILL 

TAAAACAGGG TGGTTTACCT ATATTAGCTC TTTTAACATT CTTATTTTAA TACTACTTAG 900 
KQG GLP ILAL LTF LF 

CCGTTCGATT TAGTTGAACG GCTTTTAGTA ATCATTTTTT TCTCATAATA CAGGTAGTTT 960 

AAGTAATTTG TCTTTAAAAA TAGTATAATA TAACTACGAA TTCAAAGAGA GGTGACTTTG 1020 

ATTATGACTG AGAACTGGTT AC AT ACT AAA GATGGTTCAG ATATTTATTA TCGTGTCGTT 1080 
MTE NWL HTK DGSD IYY R V V 
I > 

GGTCAAGGTC AACCGATTGT TTTTTTACAT GGCAATAGCT TAAGTAGTCG CTATTTTGAT 1140 
GQGQ PIV FLH GNSL SSR YFD 

AAGCAAATAG CATATTTTTC TAAGTATTAC CAAGTTATTG TTATGGATAG TAGAGGGCAT 1200 
K Q I A YFS KYY Q V I V MDS RGH 

GGCAAAAGTC ATGCAAAGCT AAATACCATT AGTTTCAGGC AAATAGCAGT TGACTTAAAG 1260 
GKSH AKL N T I SFRQ IAV DLK 
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GATATCTTAG TTCATTTAGA GATTGATAAA GTTATATTGG TAGGCCATAG CGATGGTGCC 1320 
D I L V HLE IDK VILV GHS DGA 

AATTTAGCTT TAGTTTTTCA AACGATGTTT CCAGGTATGG TTAGAGGGCT TTTGCTTAAT 1380 
NLAL VFQ TMF PGMV RGL LLN 

TCAGGGAACC TGACTATTCA TGGTCAGCGA TGGTGGGATA TTCTTTTAGT AAGGATTGCC 1440 
SGNL TIH GQR WWDI LLV R I A 

TATAAATTCC TTCACTATTT AGGGAAACTC TTTCCGTATA TGAGGCAAAA AGCTCAAGTT 1500 
YKFL HYL GKL FPYM RQK A Q V 

ATTTCGCTTA TGTTGGAGGA TTTGAAGATT AGTCCAGCTG ATTTACAGCA TGTGTCAACT 1560 
ISLM LED LKI SPAD LQH VST 

CCTGTAATGG TTTTGGTTGG AAATAAGGAC ATAATTAAGT TAAATCATTC TAAGAAACTT 1620 
PVMV LVG NKD IIKL NHS KKL 

GCTTCTTATT TTCCAAGGGG GGAGTTTTAT TCTTTAGTTG GCTTTGGGCA TCACATTATT 1680 
ASYF PRG EFY SLVG FGH HII 

AAGCAAGATT CCCATGTTTT TAATATTATT GCAAAAAAGT TTATCAACGA TACGTTGAAA 1740 
KQDS HVF Nil AKKF IND TLK 

GGAGAAATTG TTGAAAAAGC TAATTGAAAA AGTCAAATCA CTGACTTCTG TGATTAAAAT 1800 
GEIV EKA N 

TGTATTTTTT ATATCTGTTT TAGTGCTTAT TATTGTTGAA ATGATTCATT TGAAACGAAC 1860 

M I H L K R T 
| > 

TATTTCTGTT GAGCAACTAA AGAGTGTTTT TGGGCAATTA TCTCCAATGA ATCTTTTCTT 1920 
ISV EQLK SVF G QL SPMN L F L 

AATTATCCTT GTGGGGGTTA TCGCTGTCTT ACCGACAACC GGATATGACT TTGTACTGAA 1980 
IIL V G V I A V L PTT GYDF VLN 

TGGACTTTTA CGTACAGATA AAAGCAAAAG GTATATTTTA CAGACTAGTT GGTGTATCAA 2040 
GLLR TDKS KRY ILQ TSWC IN 

CACTTTTAAT AACTTGTCAG GATTCGGTGG CTTAATCGAT ATTGGGTTGC GCATGGCTTT 2100 
TFNN LSGF GGL IDI GLRM AF 

TTATGGTAAA AAAGGTCAAG AGAAGAGTGA CCTAAGAGAA GTGACTCGTT TTTTACCCTA 2160 
YGKK GQEK SDL REV TRFL PY 

TCTTATTTCT GGTCTGTCAT TTATTAGTGT GATTGCCTTA ATCATGAGCC ATATTTTTCA 2220 
LISG LSFI SVI A L I MSHI FH 

TGCCAAAGCT AGTGTTGATT ACTATTATTT GGTATTAATT GGTGCTAGTA TGTATTTTCC 2280 
AKAS VDYY YLV LIG ASMY FP 

TGTTATTTAT TGGATTTCTG GTCATAAAGG AAGCCATTAT TTCGGAGATA TGCCATCTAG 2340 
VIYW ISGH KGS HYF GDMP SS 

TACTCGTATA AAATTAGGTG TTGTTTCTTT TTTTGAATGG GGATGTGCGG CCGCAGCATT 2400 
TRIK LGVV SFF EWG C A A A AF 

TATAATTATC GGTTATTTAA TGGGCATTCA TCTACCAGTT TATAAAATTT TACCACTATT 24 60 
IIIG YLMG IHL PVY KILP LF 
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TTGTATTGGT TGTGCCGTCG GGATTGTATC CCTTATTCCC GGTGGATTAG GAAGTTTTGA 2520 
CIGC AVGI VSL IPG GLGS FE 

ATTAGTTCTA TTTACAGGGT TTGCTGCCGA GGGACTACCT AAAGAAACTG TGGTTGCATG 2580 
LVLF T G F A AEG LPK ETVV AW 

GTTATTACTT TATCGTTTAG CCTACTATAT TATTCCATTC TTTGCAGGTA TCTATTTCTT 2640 
L L L Y RLAY Y I I PFF AGIY FF 

TATCCATTAT TTAGGTAGTC AAATAAATCA ACGTTATGAA AATGTCCCGA AAGAGTTAGT 2700 
IHYL GSQI NQR YEN VPKE L V 

ATCAACTGTT CTACAAACCA TGGTGAGCCA TTTGATGCGT ATTTTAGGTG CATTCTTAAT 2760 
STVL QTMV SHL MRI LGAF LI 

| > 

ATTTTCAACA GCATTTTTTG AAAATATTAC TTATATTATG TGGTTGCAGA AGCTAGGCTT 2820 
FSTA FFEN ITY IMW LQKL GL 

GGACCCATTA CAAGAACAAA TGTTATGGCA GTTTCCAGGT TTATTGCTGG GGGTTTGTTT 2880 
DPLQ EQML WQF PGL LLGV CF 

TATTCTCTTA GCTAGAACTA TTGATCAAAA AGTGAAAAAT GCTTTTCCAA TTGCTATTAT 2940 
ILLA RTID QKV KNA FPIA II 

CTGGATTACT TTGACATTGT TTTATCTTAA TTTAGGTCAT ATTAGTTGGC GACTATCTTT 3000 
WITL TLFY LNL G H I. S W R L SF 

CTGGTTTATT TTACTATTGT TAGGCTTATT AGTCATTAAG CCAACTCTCT ATAAAAAACA 3060 
WFIL LLLG L L V IKP TLYK KQ 

ATTTATTTAT AGCTGGGAAG AGCGTATTAA GGATGGAATC ATTATCGTTA GTTTAATGGG 3120 
FIYS WEER IKD 6 11 IVSL MG 

AGTTCTATTT TATATTGCAG GACTACTATT CCCTATCAGG GCTCATATTA CAGGTGGTAG 3180 
VLFY IAGL LFP IRA HITG GS 

TATTGAACGC CTGCATTATA TCATAGCATG GGAGCCGATA GCATTGGCTA CGTTGATTCT 3240 
IERL HYII AWE PIA LATL I L 

TACTCTCGTT TATTTATGTT TGGTTAAGAT TTTACAAGGA AAATCTTGTC AGATTGGTGA 3300 
TLVY LCLV K I L QGK SCQI GD 

TGTGTTCAAT GTGGATCGTT ATAAAAAACT ACTTCAAGCT TACGGTGGTT CTTCGGATAG 3360 
VFNV DRYK KLL QAY G G S S DS 

CGGTTTAGCC TTTTTAAATG ATAAAAGGCT CTACTGGTAC CAAAAAAATG GAGAAGATTG 3420 
GLAF LND K RLY WYQ KNGE DC 

CGTTGCGTTC CAATTTGTAA TTGTCAATAA TAAATGTCTT ATTATGGGGG AACCAGCCGG 3480 
VAFQ FVIV NNK CLI MGEP AG 

TGATGACACT TATATTCGTG AAGCTATTGA ATCGTTTATT GATGATGCTG ATAAGCTAGA 354 0 
DDTY IREA IES FID DADK LD 

CTATGACCTT GTTTTTTACA GTATTGGACA GAAGTTGACA CTACTTTTAC ATGAGTATGG 3600 
YDLV FYSI GQK LTL LLHE YG 

TTTTGACTTT ATGAAAGTTG GTGAGGATGC TTTAGTTAAT TTAGAAACGT TTACTCTTAA 3660 
FDFM KVGE DAL VNL ETFT LK 
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AGGGAATAAG TACAAACCTT TCAGAAATGC CCTAAATAGA GTTGAAAAGG ATGGTTTCTA 3720 
GNKY K P F R NAL NRV EKDG FY 

TTTCGAAGTT GTACAATCGC CACATAGTCA AGAGCTACTA AATAGTTTGG AAGAGATTTC 3780 
FEVV QSPH SQE LLN SLEE IS 

TAATACTTGG TTAGAAGGAC GTCCTGAAAA AGGTTTCTCA CTAGGATATT TTAATAAAGA 3840 
NTWL EGRP E K G F S L GYFN KD 

TTATTTCCAA CAAGCCCCAA TAGCTTTGGT AAAAAATGCT GAACACGAAG TTGTTGCTTT 3900 
YFQQ APIA LVK NAE HEVV A F . 

TGCTAATATT ATGCCAAACT ATGAAAAGAG TATTATCTCT ATTGATTTAA TGCGTCACGA 3960 
ANIM PNYE KSI I 'SI DLMR HD 

TAAACAGAAA ATTCCGAATG GCGTTATGGA TTTCCTCTTT TTATCATTAT TCTCTTATTA 4020 
KQKI PNGV MDF LFL SLF S YY 

TCAAGAGAAG GGATACCACT ATTTTGATTT GGGGATGGCA CCTTTATCAG GAGTTGGTCG 4080 
QEKG YHYF DLG MAP LSGV GR 

CGTTGAAACA AGTTTTGCTA AAGAGAGAAT GGCGTATCTT GTCTATCATT TCGGTAGTCA 4140 
VETS FAKE RMA Y L V YHFG SH 

TTTCTACTCA TTTAATGGTT TACACAAGTA TAAGAAGAAG TTTACACCAT TGTGGTCGGA 4200 
FYSF NGLH KYK KKF TPLW SE 

ACGTTATATT TCTTGTTCTC GTTCGTCCTG GTTAATTTGT GCTATTTGTG CCCTATTAAT 4260 
RYIS CSRS SWL ICA ICAL LM 

GGAAGATAGT AAAATTAAGA TTGTTAAATA AGCTTTATTT GGCAATTAAA AAGAGCATGT 4320 
EDSK IKIV K 

CATGCGACAT GCTCTTTTTA AATCATTTAA TACCATTGAT TGCTTGAATC TACTTTATAA 4380 

TATGATGTGC TTTTAAATAT TGTTTAGCTA CTGTAGCTGC TGATTTATGC TTTACAGCTA 4 440 

CTTGGTAGTT CATTTCTTGC ATTTCTTTTT CAGTGATATG ACCAGCAAGT TTATTGAGAG 4500 

CTTTTTTTAC TTGA (SEQ ID N0:1) , 4514 

FIG. la 
[clonel-dna/aa] 
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SGKEPANRFS 


WAKNKLLING 


n UTT A I\TTT T7T7\ 1 rAT?T 


PDYPGKTYFI 


50 


ILLTAWTLMA 


LVTALVGWDN 


RYGSFLSLLI LLFQLGSSAG 


TYPIELSPKF 


100 


FOTTOPPT PM 
£ \d i lyre L xr rU 


TYSVSGLRET 


ISLTGDVNHQ WRMLVIFLVS 


SMILALLIYR 


150 




ID N0:2) 






154 






FIG. lb 






MSTLTIIIAT 


LTALEHFYIM YLETLATQSN MTGKIFSMSK 


EELSYLPVIK 


50 


LFKNQGVYNG 


LIGLFLLYGL 


YISQNQEIVA VFLINVLLVA 


IYGALTVDKK 


100 


ILLKQGGLPI 


LALLTFLF 


(SEQ ID NO:3) 




118 






FIG. lc 






• 

MTENWLHTKD 


GSDIYYRVVG 


QGQPIVFLHG NSLSSRYFDK QIAYFSKYYQ 


50 


VIVMDSRGHG 


KSHAKLNTIS 


FRQIAVDLKD ILVHLEIDKV 


ILVGHSDGAN 


100 


LALVFQTMFP 


GMVRGLLLNS 


GNLTIHGQRW WDILLVRIAY 


KFLHYLGKLF 


150 


PYMRQKAQVI 


SLMLEDLKIS 


PADLQHVSTP VMVLVGNKDI 


IKLNHSKKLA 


200 


SYFPRGEFYS 


LVGFGHHIIK QDSHVFNIIA KKFINDTLKG 


EIVEKAN 


247 


(SEQ ID N0:4) 









FIG. Id 



5/40 



WO 99/42588 



PCT/CA99/00114 



MIHLKRTISV 


EQLKSVFGQL 


SPMNLFLIIL 


VGVIAVLPTT 


GYDFVLNGLL 

VJ X \*t L. V XJ ll VJ Xi XJ 


SO 

■J u 


RTDKSKRYIL 


QTSWCINTFN 


NLSGFGGLID 


IGLRMAFYGK 


KGOEKSDLRF 


i nn 

lull 


VTRFLPYLIS 


GLSFISVIAL 


IMSHIFHAKA 


SVDYYYLVLI 


GA^MYFPVTY 
\jr\*jn i. e t v x X 


10 u 


WISGHKGSHY 


FGDMPSSTRI 


KLGVVSFFEW 


GCAAAAFITT 


vj i ijrivjxriijtr v 




YKILPLFCIG 


CAVGIVSLIP 


GGLGSFELVL 


FTGFAAEGLP 


KET VVAWT T T 


OCA 

zou 


YRLAYYIIPF 


FAGIYFFIHY 


LGSQINQRYE 


NVPKELVSTV 


LOTMV^HT.MR 


"*nn 


ILGAFLIFST 


AFFENITYIM 


WLQKLGLDPL 


OEOMLWOFPG 


LLLGVCFILT 

iJXJU VJ VUC X XiXj 


^n 


ARTIDQKVKN 


AFPIAIIWIT 


LTLFYLNLGH 


ISWRLSFWFI 


LLLLGLLVIK 


d nn 
*4 u u 


PTLYKKQFIY 


SWEERIKDGI 


IIVSLMGVLF 


YIAGLLFPIR 


AHITGGSIER 


a ^n 


LHYIIAWEPI 


ALATLILTLV 


YLCLVKILQG 


KSCQIGDVFN 


VDRYKKLLQA 


son 

JUU 


YGGSSDSGLA 


FLNDKRLYWY 


QKNGEDCVAF 


QFVIVNNKCL 


IMGEPAGDDT 


DDU 


YIREAIESFI 


DDADKLDYDL 


VFYSIGQKLT 


LLLHEYGFDF 


MKVGEDALVN 


ouu 


LETFTLKGNK 


YKPFRNALNR 


VEKDGFYFEV 


VQSPHSQELL NSLEEISNTW 


650 


LEGRPEKGFS 


LGYFNKDYFQ 


QAPIALVKNA 


EHEVVAFANI 


MPNYEKSIIS 


700 


IDLMRHDKQK 


IPNGVMDFLF 


LSLFSYYQEK 


GYHYFDLGMA 


PLSGVGRVET 


/ ou 


SFAKERMAYL 


VYHFGSHFYS 


FNGLHKYKKK 


FTPLWSERYI 


SCSRSSWLIC 


o UU 


AICALLMEDS 


KIKIVK (SEQ ID NO: 5) 






QIC 

bib 






FIG. 


le 






MRILGAFLIF 


STAFFENITY 


IMWLQKLGLD 


PLQEQMLWQF 


PGLLLGVCFI 


^n 


LLARTIDQKV 


KNAFPIAIIW 


ITLTLFYLNL 


GHISWRLSFW 


FILLLLGLLV 


i nn 

xUU 


IKPTLYKKQF 


IYSWEERIKD 


GIIIVSLMGV 


LFYIAGLLFP 


IRAHITGGSI 


i ^n 
10 u 


ERLHYIIAWE 


PIALATLILT 


LVYLCLVKIL 


QGKSCQIGDV 


FNVDRYKKLL 


o n n 
zuu 


QAYGGSSDSG 


LAFLNDKRLY 


WYQKNGEDCV 


AFQFVIVNNK 


CLIMGEPAGD 


£OU 


DTYIREAIES 


FIDDADKLDY 


DLVFYSIGQK 


LTLLLHEYGF 


DFMKVGEDAL 


300 


VNLETFTLKG 


NKYKPFRNAL 


NRVEKDGFYF 


EWQSPHSQE 


LLNSLEEISN 


350 


TWLEGRPEKG 


FSLGYFNKDY 


FQQAPIALVK 


NAEHEVVAFA 


NIMPNYEKSI 


400 


ISIDLMRHDK 


QKIPNGVMDF 


LFLSLFSYYQ 


EKGYHYFDLG 


MAPLSGVGRV 


450 


ETSFAKERMA 


YLVYHFGSHF 


YSFNGLHKYK 


KKFTPLWSER 


YISCSRSSWL 


500 


ICAICALLME 


DSKIKIVK 


(SEQ ID NO: 


6) 




518 






FIG. 


If 
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AATTTTGATA TCGAAACAAC AACTTTTGAG GCAATGAAAA AGCACGCGTC ATTATTGGAG 60 
N D I ET.T TFE AMKK HAS LLE 

AAAATATCTG TTGAGCGTTC TTTTATTGAA TTTGATAAAC TTCTATTAGC ACCTTATTGG 120 
K I S V ERS FIE FDKL LLA PYW 

CGTAAAGGAA TGCTGGCACT AATAGATAGT CATGCTTTTA ATTATCTACC ATGCTTAAAA 180 
RKGM LAL IDS HAFN YLP CLK 

AATAGGGAAT TACAATTAAG CGCCTTTTTG TCCCAGTTAG ATAAAGATTT TTTATTTGAG 240 
NREL QLS AFL SQLD KDF LFE 

ACATCAGAAC AAGCTTGGGC ATCACTCATC TTGAGTATGG AAGTTGAACA CACAAAGACT 300 
TSEQ AWA SLI LSME VEH TKT 

TTTTTAAAAA AATGGAAGAC ATCAACTCAC TTTCAAAAAG ATGTTGAGCA TATAGTGGAT 360 
F L K K W K T S T H F Q K D VEH I V D 

GTTTATCGTA TTCGTGAACA AATGGGATTG GCTAAAGAAC ATCTTTATCG TTATGGAAAA 420 
V Y R I REQ MGL AKEH LYR YGK 

ACTATAATAA AACAAGCGGA AGGTATTCGC AAAGCAAGAG GCTTGATGGT TGATTTCGAA 4 80 
TIIK QAE GIR KARG LMV DFE 

AAAATAGAAC AACTAGATAG TGAGTTAGCA ATCCATGATA GGCATGAGAT AGTTGTCAAT 540 
KIEQ LDS ELA IHDR H E I VVN 

GGTGGCACCT TAATCAAGAA ATTAGGAATA AAACCTGGTC CACAGATGGG AGATATTATC 600 
GGTL IKK LGI KPGP QMG DII 

TCTCAAATTG AATTAGCCAT TGTTTTAGGA CAACTGATTA ATGAAGAAGA GGCTATTTTA 660 
S Q I E LAI V L G Q L I N E E E AIL 

CATTTTGTTA AGCAGTACTT GATGGATTAG AGAGGATTAT ATGAGCGATT TTTTAGTAGA 720 
HFVKQYLMD MS DFLVD 

I > 

TGGATTGACT AAGTCGGTTG GTGATAAGAC GGTCTTTAGT AATGTTTCAT TTATCATCCA 780 
GLT KSVG DKT VFS NVSF IIH 

TAGTTTAGAC CGTATTGGGA TTATTGGTGT CAATGGAACT GGAAAGACAA CACTATTAGA 840 
SLD RIGI IGV NGT G K T • T LLD 

TGTTATTTCG GGTGAATTAG GTTTTGATGG TGATCGTTCC CCTTTTTCAT CAGCTAATGA 900 
VIS GELG FDG DRS PFSS AND 

TTATAAGATT GCTTATTTAA AACAAGAACC AGACTTTGAT GATTCTCAGA CAATTTTGGA 960 
YKI AYLK QEP DFD DSQT ILD 

CACCGTACTT TCTTCTGACT TAAGAGAGAT GGCTTTAATT AAAGAATATG AATTATTGCT 1020 
TVL SSDL REM A L I KEYE LLL 

TAATCACTAC GAAGAAAGTA AGCAATCACG TCTAGAGAAA GTAATGGCAG AAATGGATTC 1080 
NHY EESK QSR LEK VMAE MDS 

TTTAGATGCT TGGTCTATTG AGAGCGAAGT CAAAACAGTA TTATCCAAAT TAGGTATTAC 1140 
LDA WSIE SEV KTV LSKL GIT 

TGATTTGCAG TTGTCGGTTG GTGAATTATC AGGAGGATTA CGAAGACGTG TTCAATTAGC 1200 
DLQ LSVG ELS GGL RRRV QLA 
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GCAAGTATTA TTAAATGATG CAGATTTATT GCTCTTAGAC GAACCTACTA ACCACTTAGA 1260 
Q V L L N D A DLL L L D E P T N H L D 

TATTGACACT ATTGCATGGT TAACGAATTT TTTGAAAAAT AGTAAAAAGA CAGTGCTTTT 1 
IDT IAWL TNF L K N SKKT V L F 

TATAACTCAT GATCGTTATT TTCTAGACAA TGTTGCAACA CGTATTTTTG AATTAGATAA 1 1ftft 
ITH D R Y F LDN VAT RIFE LDK 

GGCACAGATT ACAGAATATC AAGGCAATTA TCAGGATTAT GTCCGACTTC GTGCAGAACA 14 A o 
A Q I TEYQ GNY QDY VRLR A E Q 

AGACGAGCGT GATGCTGCTA GTTTACATAA AAAGAAACAG CTTTATAAAC AGGAACTAGC 1500 
DER DAAS LHK KKQ LYKQ ELA 

TTGGATGCGT ACTCAGCCAC AAGCTCGTGC AACGAAACAA CAGGCTCGTA TTAATCGTTT 1560 
WMR TQPQ ARA TKQ Q A R I NRF 

TCAAAATCTA AAAAACGATT TACACCAAAC AAGCGATACA AGCGATTTGG AAATGACATT 1620 
QNL KNDL HQT SDT SDLE MTF 

TGAAACAAGT CGAATTGGGA AAAAGGTTAT TAATTTTGAA AATGTCTCTT TTTCTTACCC lfiflO 
ET S RIGK K V I NFE NVSF SYP 

AGATAAATCT ATCTTGAAAG ACTTTAATTT GTTAATTCAA AATAAAGACC GTATTGGCAT 1740 
DKS ILKD FNL LIQ NKDR IGI 

CGTTGGAGAT AATGGTGTTG GAAAGTCAAC CTTACTTAAT TTAATTGTTC AAGATTTACA 1800 
VGD N G V G KST LLN LIVQ DLQ 

GCCGGATTCG GGTAATGTCT CTATTGGTGA AACGATACGT GTAGGTTACT TTTCACAACA 1860 
P D S GNVS IGE TIR VGYF SQQ 

ACTTCATAAT ATGGATGGCT CAAAACGTGT TATTAATTAT TTGCAAGAGG TTGCAGATGA 1920 
L H N MDGS KRV INY LQEV ADE 

GGTTAAAACT AGTGTCGGTA CAACAAGTGT GACAGAACTA TTGGAACAAT TTCTCTTTCC 1980 
VKT SVGT TSV TE L LEQF LFP 

ACGTTCGACA CATGGAACAC AAATTGCAAA ATTATCAGGT GGTGAGAAAA AAAGACTTTA 2040 
R ST HGTQ IAK LSG GEKK RLY 

CCTTTTAAAA ATCCTGATTG AAAAGCCTAA TGTGTTACTA CTTGATGAGC CGACAAATGA 2100 
LL K I L I E KPN V L L LDEP TND 

CTTAGATATT GCTACATTAA CTGTTCTTGA AAATTTTTTA CAAGGCTTTG GTGGTCCTGT 2160 
LDI ATLT VLE NFL QGFG GPV 

GATTACAGTT AGTCACGATC GTTACTTTTT AGATAAAGTG GCTAATAAAA TTATTGCGTT 2220 
ITV SHDR YFL DKV ANKI IAF 

TGAAGATAAC GATATCCGTG AATTTTTTGG TAATTATACT GATTATTTAG ATGAAAAAGC 2280 
EDN DIRE FFG NYT DYLD E K A 

ATTTAATGAG CAAAATAATG AAGTTATCAG TAAAAAAGAG AGTACCAAGA CAAGTCGTGA 2340 
FNE QNNE VIS KKE STKT SRE 

AAAGCAAAGT CGTAAAAGAA TGTCTTACTT TGAAAAACAA GAATGGGCGA CAATTGAAGA 2400 
KQS RKRM SYF EKQ EWAT IED 

CGATATTATG ATATTGGAAA ATACTATCAC TCGTATAGAA AATGATATGC AAACATGTGG 2460 
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DIM I L E N TIT R I E NDMQ TCG 

TAGTGATTTT ACAAGGTTAT CTGATTTACA AAAGGAATTA GATGCAAAAA ATGAAGCACT 2520 
S D F T.RLS DLQ KEL DAKN EAL 

TCTAGAAAAG TATGACCGTT ATGAGTACCT TAGTGAGTTA GACACATGAT TATCCGTCCG 25B0 
LEK YDRY EYL SEL DTMI I R P 

I > 

ATTATTAAAA ATGATGACCA AGCAGTTGCA CAATTAATTC GACAAAGTTT ACGCGCCTAT 2640 
IIKN DDQ A V A QLIR Q S L RAY 

GATTTAGATA AACCTGATAC AGCATATTCA GACCCTCACT TAGATCATTT GACCTCATAC 2700 
DLDK PDT AYS DPHL DHL TSY 

TACGAAAAAA TAGAGAAGTC AGGATTCTTT GTCATTGAGG AGAGAGATGA GATTATTGGC 2760 
YEKI EKS GFF VIEE RDE IIG 

TGTGGCGGCT TTGGTCCGCT GAAAAATCTA ATTGCAGAGA TGCAGAAGGT GTACATTGCA 2820 
CGGF GPL KNL IAEM QKV YIA 

GAACGTTTCC GTGGTAAGGG GCTTGCTACT GATTTAGTGA AAATGATTGA AGTAGAAGCT 2880 
ERFR GKG LAT DLVK MIE V E A 

CGAAAAATTG GGTATAGACA ACTTTATTTA GAGACAGCCA GTACTTTGAG TAGGGCAACT 2940 
RKIGYRQLYL ETAS TLS RAT 

GCGGTTTATA AGCATATGGG ATATTGTGCC TTATCGCAAC CAATAGCAAA TGATCAAGGT 3000 
AVYK HMG YCA LSQP IAN DQG 

CATACAGCTA TGGATATTTG GATGATTAAA GATTTATAAG TTGAAAGTGG ATTAGTGAAC 3060 
HTAMDIWMIKDL 

ATGGATTAAT TATTTTGAGA TAAGAGGAAA GAAAAGGAGA CATATATGGC ATATATTTGG 3120 

MA Y I W 
| > 

TCTTATTTGA AAAGGTACCC CAATTGGTTA TGGCTTGATT TACTAGGAGC TATGCTTTTT 3180 
SYLK RYP NWL WLDL LGA M L F 

GTGACGGTTA TCCTAGGAAT GCCCACAGCC TTAGCGGGTA TGATTGATAA TGGCGTTACA 3240 
VTVI LGM PTA L A G M IDN GVT 

AAAGGTGATC GGACTGGAGT TTATCTGTGG ACGTTCATCA TGTTTATATT TGTTGTACTA 3300 
KGDR TGV Y LW TFIM FIF V V L 

GGTATTATTG GGCGTATTAC GATGGCTTAC GCATCTAGTC GCTTAACGAC AACAATGATT 3360 
GIIG RIT MAY ASSR LTT TMI 

AGAGATATGC GTAATGATAT GTATGCTAAG CTTCAAGAAT ACTCCCATCA TGAATATGAA 3420 
RDMR NDM YAK LQEY SHH E.YE 

CAGATAGGTG TATCTTCACT AGTGACACGT ATGACAAGCG ATACTTTTGT TTTGATGCAA 3480 
QIGV SSL VTR MTSD TFV LMQ 

TTTGCTGAAA TGTCTTTACG TTTAGGCCTA GTAACTCCTA TGGTAATGAT TTTTAGCGTG 3540 
FAEM SLR LGL VTPM VMI FSV 

GTTATGATAC TAATTACGAG TCCATCTTTG GCTTGGCTTG TAGCGGTTGC GATGCCTCTT 3600 
VMIL ITS PSL AWLV AVA MPL 

TTGGTAGGAG TCGTTTTATA TGTAGCTATA AAAACAAAAC CTTTATCTGA AAGACAACAG 3660 
LVGV VLY VAI KTKP LSE RQQ 
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ACTATGCTTG ATAAAATCAA TCAATATGTT CGTGAAAATT TAACAGGGTT ACGCGTTGTT 3720 
TMLD KIN QYV R E N L TGL RVV 

AGAGCCTTTG CAAGAGAGAA TTTTCAATCA CAAAAATTTC AAGTCGCTAA CCAACGTTAC 3780 
RAFA REN FQS QKFQ VAN QRY 

ACAGATACTT CAACTGGTCT TTTTAAATTA ACAGGGCTAA CAGAACCACT TTTCGTTCAA 1ft4 n 
TDTS TGL FKL TGLT E P L F V Q 

ATTATTATTG CAATGATTGT GGCTATCGTT TGGTTTGCTT TGGATCCCTT ACAAAGAGGT 3900 
I I I A M I V AIV WFAL DPL QRG 

GCTATTAAAA TAGGGGATTT AGTTGCTTTT ATCGAATATA GCTTCCATGC TCTCTTTTCA 3960 
AIKI GDL VAF IEYS FHA LFS 

TTTTTGCTAT TTGCCAATCT TTTTACTATG TATCCTCGTA TGGTGGTATC AAGCCATCGT 4020 
FLLF ANL FTM YPRM V V S SHR 

ATTAGAGAGG TGATGGATAT GCCAATCTCT ATCAATCCTA ATGCCGAAGG TGTTACGGAT 4080 
IREV MDM PIS INPN AEG VTD 

ACGAAACTTA AAGGGCATTT AGAATTTGAT AATGTAACAT TCGCTTATCC AGGAGAAACA 4140 
TKLK GHL EFD NVTF AYP GET 

GAGAGTCCCG TTTTGCATGA TATTTCTTTT AAAGCTAAGC CTGGAGAAAC AATTGCTTTT 4200 
ESPV LHD ISF KAKP GET IAF 

ATTGGTTCAA CAGGTTCAGG AAAATCTTCT CTTGTTAATT TGATTCCACG TTTTTATGAT 4260 
IGST GSG KSS L V N L IPR FYD 

GTGACACTTG GAAAAATCTT AGTAGATGGA GTTGATGTAA GAGATTATAA CCTTAAATCA 4320 
VTLG KIL VDG VDVR DYN LKS 

CTTCGCCAAA AGATTGGATT TATCCCCCAA AAAGCTCTTT TATTTACAGG GACAATAGGA 4380 
LRQK IGF IPQ K A L L FTG TIG 

GAGAATTTAA AATATGGAAA AGCTGATGCT ACTATTGATG ATCTTAGACA AGCGGTTGAT 44 40 
ENLK YGK ADA TIDD L RQ AVD 

ATTTCTCAAG CTAAAGAGTT TATTGAGAGT CACCAAGAAG CCTTTGAAAC GCATTTAGCT 4 500 
ISQA KEF IES HQEA FET H LA" 

GAAGGTGGGA GCAATCTTTC TGGGGGTCAA AAACAACGGT TATCTATTGC TAGGGCTGTT 4560 
EGGS NLS GGQ KQRL SIA RAV 

GTTAAAGATC CAGATTTATA TATTTTTGAT GATTCATTTT CTGCTCTCGA TTATAAGACA 4 620 
V K D P DL Y IFD DSFS ALD YKT 

GACGCTACTT TAAGAGCGCG TCTAAAAGAA GTAACCGGTG ATTCTACAGT TTTGATAGTT 4 680 
DATL RAR LKE V T G D STV LIV 

GCTCAAAGGG TGGGTACGAT TATGGATGCT GATCAGATTA TTGTCCTTGA TGAAGGCGAA 4740 
A Q R V GTI MDA DQII V L D EGE 

ATTGTCGGTC GTGGTACCCA CGCTCAATTA ATAGAAAATA ATGCTATTTA TCGTGAAATC 4800 
IVGR GTH AQL IENN A I Y REI 

GCTGAGTCAC AACTGAAGAA CCAAAACTTA TCAGAAGGAG AGTGATTGTA TGAGAAAAAA 4860 
AESQ LKN QNL SEGE MRKK 



10/40 



WO 99/42588 



PCT/CA99/00114 



ATCTGTTTTT TTGAGATTAT GGTCTTACCT AACTCGCTAC AAAGCTACTC TTTTCTTAGC 4 920 

S V F LRLW SYL TRY KATL FLA 

GATTTTTTTG AAAGTTTTAT CTAGTTTTAT .GAGTGTTCTG GAGCCTTTTA TTTTAGGGTT 4 980 

I F L KVLS S F M SVL EPFI LGL 

AGCGATAACA GAGTTGACTG CTAACCTTGT TGATATGGCT AAGGGAGTTT CTGGGGCAGA 5040 

AIT ELTA N L V DMA KGVS GAE 

ATTGAACGTT CCTTATATTG CTGGTATTTT GATTATTTAT TTTTTCAGAG GTGTTTTCTA SI 00 

LNV PYIA GIL I I Y FFRG V F Y 

TGAATTAGGT TCTTATGGCT CAAATT (SEQ ID NO: 7) 
ELGSYGSN 



FIG. 2a 
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NFDIETTTFE 


AMKKHASLLE 


KISVERSFIE 


FDKLLLAPYW 


RKGMLALIDS 


50 


HAFNYLPCLK 


NRELQLSAFL 


SQLDKDFLFE 


TSEQAWASLI 


LSMEVEHTKT 


100 


FLKKWKTSTH 


FQKDVEHIVD 


VYRIREQMGL AKEHLYRYGK TIIKQAEGIR 


150 


KARGLMVDFE 


KIEQLDSELA 


IHDRHEIWN 


GGTLIKKLGI 


KPGPQMGDII 


200 


SQIELAIVLG 


QLINEEEAIL 


HFVKQYLMD 


(SEQ ID NO:8) 


229 






FIG. 2b 






MSDFLVDGLT 


KSVGDKTVFS 


NVSFIIHSLD 


RIGIIGVNGT 


GKTTLLDVIS 


50 


GELGFDGDRS 


PFSSANDYKI 


AYLKQEPDFD 


DSQTILDTVL 


SSDLREMALI 


100 


KEYELLLNHY 


EESKOSRLEK 


VMAEMDSLDA 


WSIESEVKTV 


LSKLGITDLQ 


150 


LSVGELSGGL 


RRRVOLAOVL 


LNDADLLLLD 


EPTNHLDIDT 


IAWLTNFLKN 


200 


SKKTVLFITH 


DRYFLDNVAT 


RIFELDKAQI 


TEYQGNYQDY VRLRAEQDER 


250 


DAASLHKKKQ 


LYKOELAWMR 


TQPQARATKQ 


QARINRFQNL 


KNDLHQTSDT 


300 


SDLEMTFETS 


RIGKKVINFE 


NVSFSYPDKS 


ILKDFNLLIQ 


NKDRIGIVGD 


350 






GNVSIGETIR 


VGYFSQQLHN 


MDGSKRVINY 


400 


LQEVADEVKT 


SVGTTSVTEL 


LEQFLFPRST 


HGTQIAKLSG 


GEKKRLYLLK 


450 


ILIEKPNVLL 


LDEPTNDLDI 


ATLTVLENFL 


QGFGGPVITV 


SHDRYFLDKV 


500 


ANKIIAFEDN 


DIREFFGNYT 


DYLDEKAFNE 


QNNEVISKKE 


STKTSREKQS 


550 


RKRMSYFEKQ 


EWATIEDDIM 


ILENTITRIE 


NDMQTCGSDF TRLSDLQKEL 


600 


DAKNEALLEK 


YDRYEYLSEL 


DT (SEQ ID NO: 9) 




622 



FIG. 2c 



MIIRPIIKND DQAVAQLIRQ SLRAYDLDKP DTAYSDPHLD HLTSYYEKIE 50 

KSGFFVIEER DEIIGCGGFG PLKNLIAEMQ KVYIAERFRG KGLATDLVKM 100 

IEVEARKIGY RQLYLETAST LSRATAVYKH MGYCALSQPI ANDQGHTAMD 150 

IWMIKDL (SEQ ID NO: 10) 157 

FIG. 2d 
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MAYIWSYLKR 


YPNWLWLDLL 


GAMLFVTVIL 


GMPTALAGMI 


DNGVTKGDRT 




GVYLWTFIMF 


IFVVLGIIGR 


ITMAYASSRL 


TTTMIRDMRN 


DMYAKLQEYS 


1UU 


HHEYEQIGVS 


SLVTRMTSDT 


FVLMQFAEMS 


LRLGLVTPMV MIFSWMILI 


ICO 


TSPSLAWLVA 


VAMPLLVGVV 


LYVAIKTKPL 


SERQQTMLDK 


INQYVRENLT 


o a rv 
200 


GLRVVRAFAR 


ENFQSQKFQV 


ANQRYTDTST 


GLFKLTGLTE 


PLFVQIIIAM 


o c r\ 

250 


IVAIVWFALD 


PLQRGAIKIG 


DLVAFIEYSF 


HALFSFLLFA 


NLFTMYPRMV 


*3 n r% 


VSSHRIREVM 


DMPISINPNA 


EGVTDTKLKG 


HLEFDNVTFA 


YPGETESPVL 


350 


HDISFKAKPG 


ETIAFIGSTG 


SGKSSLVNLI 


PRFYDVTLGK 


ILVDGVDVRD 


400 


YNLKSLRQKI 


GFIPQKALLF 


TGTIGENLKY 


GKADATIDDL 


RQAVDISQAK 


450 


EFIESHQEAF 


ETHLAEGGSN 


LSGGQKQRLS 


IARAVVKDPD 


LYIFDDSFSA 


500 


LDYKTDATLR 


ARLKEVTGDS 


TVLIVAQRVG 


TIMDADQIIV 


LDEGEIVGRG 


550 


THAQLIENNA 


IYREIAESQL 


KNQNLSEGE 


(SEQ ID NO: 


11) 


579 






FIG. 2e 






MRKKSVFLRL 


WSYLTRYKAT 


LFLAIFLKVL 


SSFMSVLEPF 


ILGLAITELT 


50 


ANLVDMAKGV 


SGAELNVPYI 


AGILIIYFFR 


GVFYELGSYG 


SN 


92 


(SEQ ID NO: 


12) 











FIG. 2f 
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AATTTGGAAG TGCTCTATCA ACAGTTGAAG TAAAGGAGAT TATTAGTGAA GAAAACATAT 60 
FGS ALS TVEV K E I ISE ENIW 

GGTTATATCG GCTCAGTTGC TGCCATTTTA CTAGCTACTC ATATTGGAAG TTACCAACTT 120 
LYR LSC C H F T SY- S YWK LPTW 

GGTAAGCATC ATATGGGTCT AGCAACAAAG GACAATCAGA TTGCCTATAT TGATGACAGC 180 
MGL ATK DNQI A Y I DDS 
I > 

AAAGGTAAGG CAAAAGCCCC TAAAACAAAC AAAACGATGG ATCAAATCAG TGCTGAAGAA 240 
KGKA KAP KTN KTMD QIS AEE 

GGCATCTCTG CTGAACAGAT CGTAGTCAAA ATTACTGACC AAGGCTATGT GACCTCACAC 300 
G I S A E Q I VVK ITDQ GYV TSH 

GGTGACCATT ATCATTTTTA CAATGGGAAA GTTCCTTATG ATGCGATTAT TAGTGAAGAG 360 
GDHY HFY NGK VPYD All SEE 

TTGTTGATGA CGGATCCTAA TTACCGTTTT AAACAATCAG ACGTTATCAA TGAAATCTTA 420 
L L M T DPN YRF KQSD VIN EIL 
I > 

GACGGTTACG TTATTAAAGT CAATGGCAAC TATTATGTTT ACCTCAAGCC AGGTAGTAAG 480 
DGYV IKV NGN YYVY LKP GSK 

CGCAAAAACA TTCGAACCAA ACAACAAATT GCTGAGCAAG TAGCCAAAGG AACTAAAGAA 540 
RKNI RTK QQI AEQV AKG TKE 

GCTAAAGAAA AAGGTTTAGC TCAAGTGGCC CATCTCAGTA AAGAAGAAGT TGCGGCAGTC 600 
AKEK GLA Q V A HLSK E EV AAV 

AATGAAGCAA AAAGACAAGG ACGCTATACT ACAGACGATG GCTATATTTT TAGTCCGACA 660 
NEAK RQG RYT TDDG YIF SPT 

GATATCATTG ATGATTTAGG AGATGCTTAT TTAGTACCTC ATGGTAATCA CTATCATTAT 720 
D I I D DLG DAY LVPH GNH YHY 

ATTCCTAAAA AGGATTTGTC TCCAAGTGAG CTAGCTGCTG CACAAGCCTA CTGGAGTCAA 780 
IPKK DLS PSE L A A A QAY WSQ 

AAACAAGGTC GAGGTGCTAG ACCGTCTGAT TACCGCCCGA CACCAGCCCC AGGTCGTAGG 840 
KQGR GAR PSD YRPT PAP GRR 

AAAGCCCCAA TTCCTGATGT GACGCCTAAC CCTGGACAAG GTCATCAGCC AGATAACGGT 900 
KAPI PDV TPN PGQG HQP DNG 

GGCTATCATC CAGCGCCTCC TAGGCCAAAT GATGCGTCAC AAAACAAACA CCAAAGAGAT 960 
GYHP APP RPN DASQ NKH QRD 

GAGTTTAAAG GAAAAACCTT TAAGGAACTT TTAGATCAAC TACACCGTCT TGATTTGAAA 1020 
E F K G KTF KEL LDQL HR L DLK 

TACCGTCATG TGGAAGAAGA TGGGTTGATT TTTGAACCGA CTCAAGTGAT CAAATCAAAC 1080 
YRHV EED GLI FEPT QVI KSN 

GCTTTTGGGT ATGTGGTGCC TCATGGAGAT CAT TAT CAT A TTATCCCAAG AAGTCAGTTA 1140 
AFGY VVP HGD HYHI IPR SQL 

TCACCTCTTG AAATGGAATT AGCAGATCGA TACTTAGCTG GCCAAACTGA GGACAATGAC 1200 
SPLE MEL ADR YLAG QTE DND 

TCAGGTTCAG AGCACTCAAA ACCATCAGAT AAAGAAGTGA CACATACCTT TCTTGGTCAT 1260 
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SGSE HSK PSD KEVT H . T F LGH 

CGCATCAAAG CTTACGGAAA AGGCTTAGAT GGTAAACCAT ATGATACGAG TGATGCTTAT 1320 
RIKA YGK G L D G K P Y DTS DAY 

GTTTTTAGTA AAGAATCCAT TCATTCAGTG GATAAATCAG GAGTTACAGC TAAACACGGA 1380 
VFSK ESI HSV DKSG VTA KHG 

GATCATTTCC ACTATATAGG ATTTGGAGAA CTTGAACAAT ATGAGTTGGA TGAGGTCGCT 1 440 
DHFH YIG FGE LEQY ELD EVA 

AACTGGGTGA AAGCAAAAGG TCAAGCTGAT GAGCTTGCTG CTGCTTTGGA TCAGGAACAA 1500 
nwvk AKG QAD ELAA ALD QEQ 

GGCAAAGAAA AACCACTCTT TGACACTAAA AAAGTGAGTC GCAAAGTAAC AAAAGATGGT 1560 
G K E K PLF DTK KVSR KVT KDG 

AAAGTGGGCT ATATGATGCC AAAAGATGGT AAGGACTATT TCTATGCTCG TGATCAACTT 1620 
KVGY MMP KDG KDYF YAR DQL 

GATTTGACTC AGATTGCCTT TGCCGAACAA GAACTAATGC TTAAAGATAA GAAGCATTAC 1680 
DLTQ IAF A EQ ELML KDK KHY 

CGTTATGACA TTGTTGACAC AGGTATTGAG CCACGACTTG CTGTAGATGT GTCAAGTCTG 1740 
R Y D I VDT G I E PRLA VDV SSL 

CCGATGCATG CTGGTAATGC TACTTACGAT ACTGGAAGTT CGTTTGTTAT CCCACATATT 1800 
r M H A GNA TYD TGSS FVI PHI 

GATCATATCC ATGTCGTTCC GTATTCATGG TTGACGCGCG ATCAGATTGC AACAGTCAAG 1860 
DHIH VVP YSW LTRD QIA TVK 

TATGTGATGC AACACCCCGA AGTTCGTCCG GATGTATGGT CTAAGCCAGG GCATGAAGAG 1920 
YVMQ HPE VRP DVWS KPG HEE 

TCAGGTTCGG TCATTCCAAA TGTTACGCCT CTTGATAAAC GTGCTGGTAT GCCAAACTGG 1980 
SGSV IPN VTP LDKR AGM PNW 

CAAATTATCC ATTCTGCTGA AGAAGTTCAA AAAGCCCTAG CAGAAGGTCG TTTTGCAACA 2040 
QIIH SAE EVQ KALA EGR FAT 

CCAGACGGCT ATATTTTCGA TCCACGAGAT GTTTTGGCCA AAGAAACTTT TGTATGGAAA 2100 
PDGY IFD PRD VLAK ETF VWK 

GATGGCTCCT TTAGCATCCC AAGAGCAGAT GGCAGTTCAT TGAGAACCAT TAATAAATCT 2160 
SIP RAD GSSL RTI NKS 

GATCTATCCC AAGCTGAGTG GCAACAAGCT CAAGAGTTAT TGGCAAAGAA AAATACTGGT 2220 
DLSQ AEW QQA QELL AKK NTG 

GATGCTACTG ATACGGATAA ACCCAAAGAA AAGCAACAGG CAGATAAGAG CAATGAAAAC 2280 
Urtiu PKE KQQA DKS NEN 

CAACAGCCAA GTGAAGCCAG TAAAGAAGAA AAAGAATCAG ATGACTTTAT AGACAGTTTA 2340 
QQPS EAS KEE KESD DFI DSL 

CCAGACTATG GTCTAGATAG AGCAACCCTA GAAGATCATA TCAATCAATT AGCACAAAAA 2400 
LDR ATL EDHI NQL AQK 

GCTAATATCG ATCCTAAGTA TCTCATTTTC CAACCAGAAG GTGTCCAATT TTATAATAAA 2460 
PKY LIF QPEG VQF YNK 

15/40 



WO 99/42588 PCT/CA99/001 1 4 

AATGGTGAAT TGGTAACTTA TGATATCAAG ACACTTCAAC AAATAAACCC TTAACCAAAA 2520 
NGEL VTY D I K TLQQ I N P 

GAAGATCTCA TTGTTAAAGC ACTGCTTTGT CAAAGCAAGT TACGGTGATT TTGAAGTCAT 2580 

TCTATGTAAC GAGTAGTGAT AAAAGTTGGA TAATAGCGGT TTTCTTTTGC AAAGAAATGG 2640 

TATCCATGTT AGAATAGTAA AAAAAGAGGA GGATTCTTGG ACTAATGTCA AATAAGTAGA 2700 

CAGAAAACTG TGTTATTTTA TTGCGTTAAA ATAATTTTCT TCTTTCTGAT TAGGGGTTAG 2760 
«K I A N F Y N E EKQ N P T L 

TCCTAGATTA GCCGTATGTG GGTTGTAATT GTTATAAAAA TTCTCAATGT ATTCAAAGCA 2820 
G L N ATH PNYN N Y F NEI YEFC 

GTCTAATTGA ACCTGTTTGA TATTTTGATA ATGTTTTCGG TTGATTTGTC TATGCTTTAA 2880 
DL 'Q VQK INQY HKR N I Q RHKL 

ATACTTGAAA AATGCTTCAG TTACGGCATT ATCATAAGGA TATCCAGGAT TAGAAAAAGA 2940 
YKF FAE TVAN DYP YGP NSFS 

ATGCATGATA TTGGCACTGC ACCCTAATAG TGAGACGCAA GAAAAACACT TTTAGGCAAT 3000 



H M 

< 



CAGTTTTCTG TACTGTACAG GCGACTGGTC GTTTAATCTC TGTTGAATTC TAGTTTCATT 3060 
L K R YQV PSQD N L R QQI RTEN 

ATAAAATGTA ATGTAATTTT TAACAATATT TGTTATACTA TCTTTGTTGT ATTTTCTCCT 3120 
YFT IYN KVIN TIS DKN YKRR 

ATTATGGAAA TAAAAGGTTT CAGTCTTTAG GACGGTGTGA AACCATTCAA TACAGGCATT 3180 
N H F YFT ETKL VTH FWE ICAN 

ATCTGCAGGT GTTCCTTTTC GAGACATTGA GCGGATAATG TCTTTTTCCG TGCAAGCCTG 3240 
DAP TGK RSMS RII DKE TCAQ 

GTAGTAAGCC ATAGAAGTAT ACACTGAGCC TTGGTCACTG TGTAAGATTG CTCCTTTATT 3300 
Y Y A M 
< 1 

TAGGCAATTT TAACTGATTA AGGGTGTCTA GTACAAAATC CGTGTCCTGA CAATCTGAGA 3360 
KPLK LQN LTD LVFD TDQ CDS 

TAGTGTAAGC TATAATTTCT CGGTTATAGA GATTCATAAT TGATGAGAGA TACAATTTAC 3420 
ITYA IIE RNY LNMI SSL YLK 

AGTTACCGAA ATATAGGTAG GTAATATCTG TTACGAGCTT TTCCTTAGGC TTATCGGCAT 3480 
CNGF YLY TID TVLK EKP KDA 

GGAAATCCCG ACTCAATTTA TTATCTGTTA AATAATAAGC TTTACCCAAA TTGGGAACTT 3540 
HGDR SLK NDT LYYA KGL NPV 

TCTTGGTACG TGTCCGACAA AGCCAGCCAT TATTTTTCAT GATACGATAG ACTTTCTTTG 3600 
KKTR TRC LWG NNKM IRY VKK 

TATTAACAGT CAATCCGTGG ATTTTTTTGA GCAATCGTGT AATGGTACGA TAGCCATAAA 3660 
TNVT LGH IKK LLRT ITR YGY 

TAAAGTGATT CTCCATACAG AGCTGTTCAA TTAATTCAAT AAGGTCATCT TTTTTTGCGG 3720 
I F H N E M 

< — | 
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CTTCTCATAC 


TCCTTTTTCC 


AACGGTAATA 


AATGAAAACT 


ATCGGGTAGT 


TGTTTTTATA 


ATCGATTTCC 


TTATCAAGCC 


TCGATACTTT 


I S K 


R I L G 


R Y K 


TCCACTTCAG 


ACAGATGTTC 


CAAGCCTTTA 


EVE 


S L H E 


L G K 


TGAAAACGAT 


AAAGCTCCTC 


GTTTTCGTAC 


u r* d 


Y L E E 


N E Y 


TTTTTGATGC 


CTAAAGTCTC 


CATAATAACT 


N K I 


G L T E 


M I V 


ATGCAAGCCA 


GCTTAGTTTC 


CCATGAATAT 


I C A 


L K T E 


W S Y 


TTCTAGTTTA 


CTAAATTTCA 


ACAGGAGTGT 


GCCTATTGTT 


GTCATCAATT 


ATTTTTCTAA 


CGGAATGAAA 


GAGAAGTGTT 


CCTTCAATCT 


TCTGTACAAC 


ATTTATAAAG 


TGTTTTTCTA 


GTAGTTGAGA 


CTACCATGAA 


TGCGGTGGTA 


Y N L 


S G H 


I R H Y 


AAGAGCTAGT 


TCTTCCAGCA 


ATTGAAAGGT 


L A L 


EEL 


L Q F T 


ACGATACGTA 


CTTTCAGCTA 


CGGCATTGTC 


R Y T 


SEA 


V A N D 


TGTGATTCCA 


AAGGCTTCCA 


ATATTTCATC 

* * * * * A A A A W 


TIG 


F A E 


LIED 


ATCTGAATGG 


AACATCTTGA 


CTTTGGTPAR 


D S H 


F M K 


V K T L 


AACGAGTTCA 


GCGGTCTTGT 


GCCAACCAAft 


V L E 


A T K 


H W G L 


GTCAATGATG 


AGGCAAACAT 


AAGCCCAACC; 


D I I 


LCV 


Y A W R 


GACTAAGGCT 


TGTAGTGGTC 


TTTCTTGCTT 


V L A 


Q L P 


R E Q K 


GGCTTCATTC 


TTGCCTCTAG 


AATGTGGTTT 


A E N 


K G R 


S H P K 


ATTGAGTCGC 


TTCATAATGC 


GTCGAATCCG 


N L R 


K M I 


R R I R 



CAAGCATATT TTGATTTTTC TGGATCCGTA 
LCI KIK RSGY 



GGTCGACCGC TTGACCTTAA AACAGTCTAG 3780 

GTCTTCCACA AGCTTGATAA GACTTACTTT 3840 

. K 

TTTAAGAGGT CAACCTGTAA TTGTAATTGT 3900 
KLL DVQL QLQ 

CCGTAGGTAT ATTGCTTGCC AACACCTTGA 3960 
GYTYQKGVGQ 

CATTTCATCC AAGTATAGAT TTGACTATTA 4 020 
WKM WTYI QSN 

CTGTTAGACT TGCCTGCTTT CTTCATATCG 4080 
RNS KGAK KMD 

GCTTTTTTAA CCATAATAAA ACATTCCTGT 4140 
A K K V M 

< 1 

TTTTCTTTTG TCTCATTTTA GGGATTCAGT 4200 

ATTCCCCGGA CTTAAATTGT GACCCTTGGT 4260 

TTCTTTTATT AAGTGAAAAG GCAACACTTT 4320 

GGCAATTAAT CTTTTAGTCA TTGGTGTTTG 4380 
.AIL RKT MPTQ 

ATTCCACCAA TGAACATAGT CTTTAGTCTT 4440 
NWW HVY DKTK 

TTCTTGATAA ACAAATTCAA TTTTGAAAGC 4500 
EQY VFE IKFA 

ATAAGGATAA CCAGCCTGAC TAAGCGAACG 4560 
YPY GAQ SLSR 

AATTAACTGA TTATCAAACT CTTTGCCACG 4620 
ILQ NDF EKGR 

GGCGTAAGGG ATGCTTTGTA TGGCTTGCTT 4 680 
AYP ISQ IAQK 

AGACAGGCCG ATGATTTCAC GGTTGTATAG 4740 
SLG I IE R N Y L 

ATTGCCTACA CGAACATAGG TTAAGTCAGT 4 800 
NGV RVY TLDT 

AAATTGCCTG TCTAAGTGGT TGGGAATAGG 4860 
FQR DLH NPIP 

GAAGGTGGCT TTCTGATAAA CAGAAACCAA 4920 
FTA KQY VSVL 

ACGACGTGAA AGTGTGATAC CTTCGTTATT 4980 
R R S LTI GENN 

TCTAGACTCG CTATCGAGAA AAATTCTTTT 5040 
RSE SDL FIRK 
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AATAGTTTCT TCAAACTCCG TTTCAGATAC TGACTCCACG GCTTGATAGT AATAACTTGA 5100 
ITE EFE TESV .SEV A Q Y Y Y S S 

GTGTGGCATA TTCAGCCAGC GACACATCTT TGAAATGCTG TATTTATCCT TATTAGCAGT 5160 
HPM NLW RCMK SIS YKD KNAT 

GATTATTTCC CTTTTTGTGC CATAATCACC GCTGCTTGCT TTAGGATATC TAATT S?1 q 

I I E RKT GYDG SSA KPY R I 
(SEQ ID N0:13) < | 

FIG. 3a 



FGSALSTVEV KEIISEENIW LYRLSCCHFT SYSYWKLPTW 40 
(SEQ ID NO: 14) 

FIG. 3b 



MGLATKDNQI AYIDDSKGKA KAPKTNKTMD QISAEEGISA EQIWKITDQ 50 

GYVTSHGDHY HFYNGKVPYD AIISEELLMT DPNYRFKQSD VINEILDGYV 100 

IKVNGNYYVY LKPGSKRKNI RTKQQIAEQV AKGTKEAKEK GLAQVAHLSK 150 

EEVAAVNEAK RQGRYTTDDG YIFSPTDIID DLGDAYLVPH GNHYHYIPKK 200 

DLSPSELAAA QAYWSQKQGR GARPSDYRPT PAPGRRKAPI PDVTPNPGQG 250 

HQPDNGGYHP APPRPNDASQ NKHQRDEFKG KTFKELLDQL HRLDLKYRHV 300 

EEDGLIFEPT QVIKSNAFGY WPHGDHYHI IPRSQLSPLE MELADRYLAG 350 

QTEDNDSGSE HSKPSDKEVT HTFLGHRIKA YGKGLDGKPY DTSDAYVFSK 400 

ESIHSVDKSG VTAKHGDHFH YIGFGELEQY ELDEVANWVK AKGQADELAA 450 

ALDQEQGKEK PLFDTKKVSR KVTKDGKVGY MMPKDGKDYF YARDQLDLTQ 500 

IAFAEQELML KDKKHYRYDI VDTGIEPRLA VDVSSLPMHA GNATYDTGSS 550 

FVIPHIDHIH WPYSWLTRD QIATVKYVMQ HPEVRPDVWS KPGHEESGSV 600 

IPNVTPLDKR AGMPNWQIIH SAEEVQKALA EGRFATPDGY I FDPRDVLAK 650 

ETFVWKDGSF SIPRADGSSL RTINKSDLSQ AEWQQAQELL AKKNTGDATD 700 

TDKPKEKQQA DKSNENQQPS EASKEEKESD DFIDSLPDYG LDRATLEDHI 750 

NQLAQKANID PKYLIFQPEG VQFYNKNGEL VTYDIKTLQQ INP 793 
(SEQ ID NO:15) 

FIG. 3c 
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MTDPNYRFKQ 


SDVINEILDG 


YVIKVNGNYY 


VYLKPGSKRK 


NIRTKQQIAE 


50 


QVAKGTKEAK 


EKGLAQVAHL 


SKEEVAAVNE 


AKRQGRYTTD 


DGYIFSPTDI 


100 


IDDLGDAYLV 


PHGNHYHYIP 


KKDLSPSELA 


AAQAYWSQKQ 


GRGARPSDYR 


150 


PTPAPGRRKA 


PIPDVTPNPG 


QGHQPDNGGY 


HPAPPRPNDA 


SQNKHQRDEF 


200 


KGKTFKELLD 


QLHRLDLKYR 


HVEEDGLIFE 


PTQVIKSNAF 


GYVVPHGDHY 


250 


HIIPRSQLSP 


LEMELADRYL 


AGQTEDNDSG 


SEHSKPSDKE 


VTHTFLGHRI 


300 


KAYGKGLDGK 


PYDTSDAYVF 


SKESIHSVDK 


SGVTAKHGDH 


FHYIGFGELE 


350 


QYELDEVANW 


VKAKGQADEL 


AAALDQEQGK 


EKPLFDTKKV 


SRKVTKDGKV 


400 


GYMMPKDGKD 


YFYARDQLDL 


TQIAFAEQEL 


MLKDKKHYRY 


DIVDTGIEPR 


450 


LAVDVSSLPM 


HAGNATYDTG 


SSFVIPHIDH 


IHVVPYSWLT 


RDQIATVKYV 


500 


MQHPEVRPDV WSKPGHEESG 


SVIPNVTPLD 


KRAGMPNWQI 


IHSAEEVQKA 


550 


LAEGRFATPD 


GYIFDPRDVL 


AKETFVWKDG 


SFSIPRADGS 


SLRTINKSDL 


600 


SQAEWQQAQE 


LLAKKNTGDA 


TDTDKPKEKQ 


QADKSNENQQ 


PSEASKEEKE 


650 


SDDFIDSLPD 


YGLDRATLED 


HINQLAQKAN 


IDPKYLIFQP EGVQFYNKNG 


700 


ELVTYDIKTL 


QQINP (SEQ ID NO: 16) 






715 






FIG. 3d 






MHSFSNPGYP 


YDNAVTEAFF 


KYLKHRQINR KHYQNIKQVQ LDCFEYIENF 


50 


YNNYNPHTAN 


LGLTPNQKEE 


NYFNAIK (SEQ ID NO: 17) 


77 






FIG. 3e 






MAYYQACTEK 


DIIRSMSRKG 


TPADNACIEW 


FHTVLKTETF 


YFHNRRKYNK 


50 


DSITNIVKNY 


ITFYNETRIQ 


QRLNDQSPVQ YRKLIA (SEQ ID NO: 18) 


86 






FIG. 3f 






MENHFIYGYR 


TITRLLKKIH 


GLTVNTKKVY 


RIMKNNGWLC 


RTRTKKVPNL 


50 


GKAYYLTDNK 


LSRDFHADKP 


KEKLVTDITY 


LYFGNCKLYL 


SSIMNLYNRE 


100 


IIAYTISDCQ 


DTDFVLDTLN 


QLKLPK (SEQ ID NO: 19) 




126 



FIG. 3g 
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MVKKAYSWET KLACIDMKKA GKSNRVIMET LGIKNNSQIY TWMKWYENEE 50 
LYRFHQGVGK QYTYGKGLEH LSEVEQLQLQ VDLLKKYRGL IRKSIK 96 
(SEQ ID NO:20) 

FIG. 3h 



IRYPKASSGD YGTKREIITA NKDKYSISKM CRWLNMPHSS YYYQAVESVS 50 
ETEFEETIKR IFLDSESRYG SRKIKICLNN EGITLSRRRI RRIMKRLNLV 100 
SVYQKATFKP HSRGKNEAPI PNHLDRQFKQ ERPLQALVTD LTYVRVGNRW 150 
AYVCLIIDLY NREIIGLSLG WHKTAELVKQ AIQSIPYALT KVKMFHSDRG 200 
KEFDNQLIDE ILEAFGITRS LSQAGYPYDN AVAESTYRAF KIEFVYQETF 250 
QLLEELALKT KDYVHWWNYH RIHGSLNYQT PMTKRLIA (SEQ ID NO: 21) 288 



FIG. 3i 



AATTTGAAAG CAGAATTATC TGTAGAAGAT GAGCAATATA CAGCAACAGT TTATGGTAAA 60 
NLKA ELS VED E Q Y T ATV YGK 

TCTGCTCATG GTTCAACACC ACAAGAAGGT GTTAATGGGG CGACTTATTT AGCTCTTTAT 120 
SAHG S T P QEG VNGA TYL A L Y 

CTAAGTCAAT TTGATTTTGA AGGTCCTGCT CGTGCTTTCT TAGATGTTAC AGCCAACATT 180 
L S Q F D F E G P A RAFL DVT ANI 

ATTCACGAAG ACTTCTCAGG TGAAAAACTT GGAGTAGCTT ATGAAGATGA CTGTATGGGA 240 
IHE D FSG EKL G V A Y EDD CMG 

CCATTGAGCA TGAATGCAGG TGTCTTCCAG TTTGATGAAA CTAATGATGA TAATACTATC 300 
PLSM NAG VFQ FDET NDD NTI 

GCTCTTAATT TCCGTTACCC ACAAGGGACA GATGCTAAAA CTATCCAAAC TAAGCTTGAG 360 
A L N F RYP QGT DAKT IQT-KLE 

AAACTTAACG GAGTTGAAAA AGTGACTCTT TCTGACCATG AACACACACC ACACTATGTA 420 
KLNG VEK VTL SDHE HTP HYV 

CCTATGGACG ATGAATTAGT ATCAACCTTA CTAGCTGTCT ATGAAAAGCA AACTGGTCTT 480 
PMDD ELV STL LAVY EKQ TGL 

AAAGGACATG AACAGGTTAT TGGTGGTGGG ACATTTGGTC GCTTACTTGA ACGGGGTGTT 540 
KGHE Q V I GGG TFGR LLE RGV 

GCATACGGTG CCATGTTCCC AGGAGATGAA AACACTATGC ATCAAGCTAA TGAGTACATG 600 
AYGA MFP GDE NTMH QAN EYM 

CCTTTAGAAA ATATTTTCCG TTCGGCTGCT ATCTACGCAG AAGCTATCTA TGAATTAATC 660 
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P L E N I F R S A A IYAE AIY ELI 
AAATAAAATA ATCCTTAAAC TAAATATGTG ATCAATGATA AAGGGTGGTG AAGACATGAA 720 

AGTGTCTTTG CCTCTTTTCA TAAGGTTAGA TTTGGAGACT TTATGACTGA CTTGGAAAAA 780 

M T D L E K 

ATTATTAAAG CAATAAAAAG TGATTCACAG AATCAAAATT ATACAGAAAA TGGTATTGAT 84 0 
I I K A IKS DSQ N Q N Y TEN GID 

CCTTTGTTTG CTGCTCCTAA AACAGCTAGG ATCAATATTG TTGGCCAAGC ACCTGGTTTA 900 
P L F A A P K TAR INIV GQA P G L 

AAAACTCAAG AAGCAAGACT CTATTGGAAA GATAAATCTG GAGATCGTCT ACGCCAGTGG 960 
K T Q E ARL YWK DKSG DRL R Q W 

CTTGGAGTTG ATGAAGAGAC ATTTTACCAT TCTGGAAAAT TTGCTGTTTT ACCTTTAGAT 10 20 
LGVD EET FYH SGKF AVL P L D 

TTTTATTACC CAGGCAAAGG AAAATCAGGA GATTTACCCC CTAGAAAAGG TTTTGCGGAG 1080 
FYYP GKG KSG DLPP RKG FAE 

AAATGGCACC CTCTTATTTT AAAAGAAATG CCTAATGTTC AATTGACCTT GCTAGTTGGT 1140 
K W H P L I I. KEM PNVQ LTL LVG 

CAGTATGCTC AGAAATATTA TCTTGGAAGC TCCGCACATA AAAATCTAAC AGAAACAGTT 1200 
UYAQ KYY LGS SAHK N L T ETV 

AAAGCTTACA AAGACTATCT ACCCGATTAT TTACCCCTGG TTCACCCATC ACCGCGAAAT 1260 
KAYK DYL PDY LPLV HPS PRN 

CAAATTTGGC TAAAGAAGAA TCCATGGTTT GAAAAAGATC TAATCGTTGA TTTACAAAAG 1320 
QIWL KKN PWF EKDL IVD LQK 

ATAGTAGCAG ATATTTTAAA AGATTAAGGA TAGGAGTTGG TATGAGAGAT AATCATCTAC 1380 
ivAD ILK D. MRDNHLH 

ACACGTATTT TTCCTATGAT TGTCAAACGG CATTTGAGGA CTATATTAAT GGTTTTACAG 1440 
SYD CQTA FED YIN GFTG 

GTGAATTTAT CACGACAGAA CATTTTGATT TATCAAATCC TTACACCGGT CAAGACGATG 1500 
TTE HFDL SNP Y TG Q DDV 

TTCCTGATTA TAGTGCTTAT TGTCAAAAAA TAGATTATCT TAATCAGAAA TATGGAAATC 1560 
SAY CQKI DYL NQK YGNR 

GATTTAAAAA AGGAATTGAA ATCGGTTATT TTAAAGATAG GGAATCAGAT ATTTTAGATT 1620 
GIE IGYF KDR ESD ILDY 

ATTTAAAAAA TAAAGAATTT GATTTAAAAC TATTGTCAAT CCATCATAAT GGTAGGTATG 1680 
KEF DLKL LSI HHN GRYD 

ATTATCTGCA AGAAGAAGCT CTGAAAGTAC CAACAAAGGG AGCTTTTAGC AGATTACTTT 1740 
YLQ EEA LKVP TKG AFS RLL. 

AATCGTATGG AATTTGCCAT AGGCCGTGTG GAAGCGCACG TTTTAGCTCA CTTTGATTAT 1800 

GGTTTTCGTA AGTTAAACTT AGATGTAGAA GATTTAAAAC CGTTTGAAAC GCAATTGAAG 1860 

CGCATTTTCA TAAAGATGTT ATCTAAGGGG TTAGCTTTTG AACTAAATAC CAAATCCCTT 1920 
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TATCTATATG GGAATGAAAA ACTTTATCGC TATGCTTTAG AGATACTCAA ACAGCTTGGT 1980 

TGTAAACAAT ACTCTATAGG CTCTGACGGT CATATTCCTG AACATTTTTG TTATGAATTT 2040 

GATAGACTTC AAGGTCTGCT AAAGGACTAT CAAATTGATG AAAATCATTT GATATGAGGA 2100 

AATTTTTGAT AAAAAAGCTA GGCAATATTG CTTAGCTTTT TTGTAATGCT ATTGATAGTT 2160 

TTAGTGAAAA TTTCAAAAAA ATAAAGAAAT CATTTACTTG TTGCAAGCGC TTGCGTAAAT 2220 

TGTTATGATT TTATTGGTAA CAATTCATTA AAAAAGGAGA ATGATATGAA AAGAAAAGAC 2280 

M K R K D 

TTATTTGGTG ATAAACAAAC TCAATACACG ATTAGAAAGT TAAGTG^Gg'aGTAGCTTCA 2340 

GTTACAACAG GGGTATGTAT TTTTCTTCAT AGTCCACAGG TATTTGCTGA AGAAGTAAGT 2400 

GTTTCTCCTG CAACTACAGC GATTGCAGAG TCGAATATTA ATCAGGTTGA CAACCAACAA 24 60 
vsfA TTA IAE SNIN QVD NQQ 

TCTACTAATT TAAAAGATGA CATAAACTCA AACTCTGAGA CGGTTGTGAC ACCCTCAGAT 2520 
cvuu INS NSET VVT PSD 

ATGCCGGATA CCAAGCAATT AGTATCAGAT GAAACTGACA CTCAAJAGGG AGTGACAGAG 2580 

CCGGATAAGG CGACAAGCCT GCTTGAAGAA AATAAAGGTC CTGTTTCAGA TAAAAATACC 2640 

TTAGATTTAA AAGTAGCACC ATCTACATTG CAAAATACTC CCGACAAAAC TTCTCAAGCT 2700 
A * STL QNTP DKT S Q A 

ATAGGTGCTC CAAGCCCTAC CTTGAAAGTA GCTAATCAAG CTCCACGGAT TGAAAATGGT 2760 
iPT LKV A N Q A PRI ENG 

TACTTTAGGC TACATCTTAA AGAATTGCCT CAAGGTCATC CTGTAGARAG CACTGGACTT 2820 

TGGATATGGG GAGATGTTGA TCAACCGTCT AGTAATTGGC CAAATGGTGG TATCCCTATG 2880 

ACTGATGCTA AGAAAGATGA TTACGGTTAT TATGTTGATT TTAAATTATC TGAAAAACAA 2940 

CGAAAACAAA TATCTTTTTT AATTAATAAC AAAGCAGGGA CAAATTTAAG CGGCGATCAT 3000 

CATATTCCAT TATTACGACC TGAGATGAAC CAAGTTTGGA TTGATGAAAA GTACGGTATA 3060 

bMN QVWI DEK YGI 

CATACTTATC AACCCCTCAA AGAAGGGTAT GTCCGTATTA ACTATTTGAG TTCCTCTAGT 3120 

AACTATGACC ACTTATCAGC ATGGCTCTTT AAAGATGTTG CAACCCCYTC AACAACTTGG 3180 
u o i\ w L c KDVA TPS TTW 

CCAGATGGTA GTAATTTTGT GAATCAAGGA CTATATGGAA GGTATATTGA TGTATCACTA 3240 
« r v NQG LYGR YID VSL 
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AAAACTAACG CCAAAGAGAT TGGTTTTCTA ATCTTAGATG AAAGTAAGAC AGGAGATGCA 3300 
KTNA KEI GFL I L 0 E SKT GDA 

GTGAAAGTTC AACCCAACGA CTATGTTTTT AGAGATTTAG CTAACCATAA CCAAATTTTT 3360 

GTAAAAGATA AGGATCCAAA GGTTTATAAT AATCCTTATT ACATTGATCA AGTGCAGCTA 3420 

AAGGATGCCC AACAAATTGA TTTAACAAGT ATTCAAGCAA GTTTTACAAC TCTAGATGGG 3480 
KDAQ QID LTS IQAS F T T LDG 

GTAGATAAAA CTGAAATTTT AAAAGAATTG AAAGTGACTG ATAAAAATCA AAATGCTATA 3540 
VDKT E I L KEL KVTD KNQ NAI 

CAAATTTCTG ATATCACTCT CGATACTAGT AAATCTCTTT TAATAATCAA AGGCGACTTT 3600 
Q I S D I T L DTS KSLL IIK GDF 

AATCCTAAAC AAGGTCATTT CAACATATCT TATAATGGTA ACAATGTCAT GACAAGGCAA 3660 
NPKQ GHF NIS YNGN NVM TRQ 

TCTTGGGAAT TTAAAGACCA ACTTTATGCT TATAGTGGAA ATTTAGGTGC AGTTCTCAAT 3720 
SWEFKDQLYAYSGNLGAVLN 

CAAGATGGTT CAAAAGTTGA AGCCAGCCTC TGGTCACCGA GTGCTGATAG TGTCACTATG 3780 
QDGS KVE ASL WSPS ADS VTM 

ATTATTTATG ACAAAGATAA CCAAAACAGG GTTGTAGCGA CTACCCCCCT TGTGAAAAAT 3840 
I I Y D KDN Q N R VVAT TPL VKN 

AATAAAGGTG TTTGGCAGAC GATACTTGAT ACTAAATTAG GTATTAAAAA CTATACTGGT 3900 
N K G V WQT ILD TKLG IKN YTG 

TACTATTATC TTTACGAAAT AAAAAGAGGT AAGGATAAGG TTAAGATTTT AGATCCTTAT 3960 
YYYL YEI KRG KDKV KIL DPY 

GCAAAGTCAT TAGCAGAGTG GGATAGTAAT ACTGTTAATG ATGATATTAA AACGGCTAAA 4020 
AKSL AEW DSN TVND DIK TAK 

GCAGCTTTTG TAAATCCAAG TCAACTTGGA CCTCAAAATT TAAGTTTTGC TAAAATTGCT 4080 
AAFV NPS QLG PQNL SFA KIA 

AATTTTAAAG GAAGACAAGA TGCTGTTATA TACGAAGCAC ATGTAAGAGA CTTCACTTCT 4140 
RQD AVI YEAH VRD FTS 

GATCGATCTT TGGATGGAAA ATTAAAAAAT CAATTTGGTA CCTTTGCAGC CTTTTCAGAG 4200 
URSL DGK LKN QFGT F A A FSE 

AAACTAGATT ATTTACAGAA ATTAGGAGTT ACACACATTC AGCTTTTACC GGTATTGAGT 4260 
KLDY LQK LGV TH IQ LLP VLS 

TATTTTTATG TTAATGAAAT GGATAAGTCA CGCTCAACAG CTTACACTTC CTCAGACAAT 4320 
YFYV NEM DKS RSTA YTS SDN 

AATTACAATT GGGGCTATGA CCCACAGAGC TATTTTGCTC TTTCTGGGAT GTATTCAGAG 4380 
NYNW GYD PQS YFAL SGM YSE 

AAACCAAAAG ATCCATCAGC ACGTATCGCC GAATTAAAAC AATTAATACA TGATATTCAT 4440 
PSA RIA ELKQ LIH DIH 
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AAACGTGGCA TGGGGGTTAT ACTTGATGTC GTCTATAATC ACACTGCAAA AACTTATCTC 4500 
KRGM GVI LDV V Y N H TAK TYL 

TTTGAGGATA TAGAACCTAA TTATTATCAC TTTATGAATG AAGATGGTTC ACCAAGAGAA 4560 
F E D I EPN YYH FMNE DGS P R E 

AGTTTTGGAG GGGGACGTTT AGGAACCACT CATGCAATGA GTCGTCGTGT TTTGGTTGAT 4630 
SFGG GRL GTT HAMS RRV LVD 

TCCATTAAAT ATCTTACAAG TGAATTTAAA GTTGATGGTT TCCGTTTTGA TATGATGGGA 4680 
SIKY LTS EFK VDGF RFD MMG 

GATCATGATG CGGCTGCGAT TGAATTAGCT TATAAAGAAG CTAAAGCTAT TAATCCTAAT 4740 
DHDA A A I ELA YKEA K A I NPN 

ATGATTATGA TTGGTGAGGG CTGGAGAACA TTCCAAGGCG ATCAAGGTCA GCCGGTTAAA 4800 
M I M I GEG WRT FQGD QGQ PVK 

CCAGCTGACC AAGATTGGAT GAAGTCAACC GATACAGTTG GCGTCTTTTC AGATGATATT 4860 
PADQ DWM KST DTVG VFS DDI 

CGTAATAGCT TGAAATCTGG TTTTCCAAAT GAAGGTACTC CAGCTTTCAT CACAGGTGGC 4920 
RNSL KSG FPN EGTP AFI TGG 

CCACAATCTT TACAAGGTAT TTTTAAAAAT ATCAAAGCAC AACCTGGGAA TTTTGAAGCA 4 980 
PQSL QGI FKN IKAQ PGN FEA 

GATTCGCCAG GAGATGTGGT GCAGTATATT GCTGCACATG ATAACCTTAC CTTGCATGAT 5040 
DSPG DVV QYI AAHD NLT LHD 

GTGATTGCAA AATCAATT (SEQ ID NO: 22) cnco 
V I A K SI. 5058 



FIG. 4a 



NLKAELSVED EQYTATVYGK SAHGSTPQEG VNGATYLALY LSQFDFEGPA 50 

RAFLDVTANI IHEDFSGEKL GVAYEDDCMG PLSMNAGVFQ FDETNDDNTI 100 

ALNFRYPQGT DAKTIQTKLE KLNGVEKVTL SDHEHTPHYV PMDDELVSTL 150 

LAVYEKQTGL KGHEQVIGGG TFGRLLERGV AYGAMFPGDE NTMHQANEYM 200 

PLENIFRSAA IYAEAIYELI K (SEQ ID NO: 23) 221 

FIG. 4b 



MTDLEKIIKA IKSDSQNQNY TENGIDPLFA APKTARINIV GQAPGLKTQE 50 

ARLYWKDKSG DRLRQWLGVD EETFYHSGKF AVLPLDFYYP GKGKSGDLPP 100 

RKGFAEKWHP LILKEMPNVQ LTLLVGQYAQ KYYLGSSAHK NLTETVKAYK 150 

DYLPDYLPLV HPSPRNQIWL KKNPWFEKDL IVDLQKIVAD ILKD 194 
(SEQ ID NO:24) 

FIG. 4c 
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MRDNHLHTYF SYDCQTAFED YINGFTGEFI TTEHFDLSNP YTGQDDVPDY 
SAYCQKIDYL NQKYGNRFKK GIEIGYFKDR ESDILDYLKN KEFDLKLLSI 
HHNGRYDYLQ EEALKVPTKG AFSRLL (SEQ ID NO: 25) 

FIG. 4d 



MKRKDLFGDK QTQYTIRKLS VGVASVTTGV CIFLHSPQVF AEEVSVSPAT 50 

TAIAESNINQ VDNQQSTNLK DDINSNSETV VTPSDMPDTK QLVSDETDTQ 100 

KGVTEPDKAT SLLEENKGPV SDKNTLDLKV- APSTLQNTPD KTSQAIGAPS 150 

PTLKVANQAP RIENGYFRLH LKELPQGHPV ESTGLWIWGD VDQPSSNWPN 200 

GAIPMTDAKK DDYGYYVDFK LSEKQRKQIS FLINNKAGTN LSGDHHIPLL 250 

RPEMNQVWID EKYGIHTYQP. LKEGYVRINY LSSSSNYDHL SAWLFKDVAT 300 

PSTTWPDGSN FVNQGLYGRY I DVSLKTNAK EIGFLILDES KTGDAVKVQP 350 

NDYVFRDLAN HNQIFVKDKD PKVYNNPYYI DQVQLKDAQQ IDLTSIQASF 400 

TTLDGVDKTE ILKELKVTDK NQNAIQISDI TLDTSKSLLI IKGDFNPKQG 450 

HFNISYNGNN VMTRQSWEFK DQLYAYSGNL GAVLNQDGSK VEASLWSPSA 500 

DSVTMIIYDK DNQNRWATT PLVKNNKGVW QTILDTKLGI KNYTGYYYLY 550 

EIKRGKDKVK ILDPYAKSLA EWDSNTVNDD IKTAKAAFVN PSQLGPQNLS 600 

FAKIANFKGR QDAVIYEAHV RDFTSDRSLD GKLKNQFGTF AAFSEKLDYL 650 

QKLGVTHIQL LPVLSYFYVN EMDKSRSTAY TSSDNNYNWG YDPQSYFALS 700 

GMYSEKPKDP SARIAELKQL IHDIHKRGMG VILDWYNHT AKTYLFEDIE 750 

PNYYHFMNED GSPRESFGGG RLGTTHAMSR RVLVDSIKYL TSEFKVDGFR 800 

FDMMGDHDAA AIELAYKEAK AINPNMIMIG EGWRTFQGDQ GQPVKPADQD 850 

WMKSTDTVGV FSDDIRNSLK SGFPNEGTPA FITGGPQSLQ GIFKNIKAQP 900 

GNFEADSPGD WQYIAAHDN LTLHDVIAKS I (SEQ ID NO:26) 931 

FIG. 4e 
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AATTCAAAGT TTGACAGAAG GTCAACTTCG TTCTGATATC CCTGAGTTCC GTGCTGGTGA 60 

-._:> LTEGQLRSDI PEFRAGD 
TACTGTACGT GTTCACGCTA AAGTTGTTGA AGGTACTCGC GAACGTATTC AGATCTTTGA 120 

AGGTGTTGTT ATCTCACGTA AAGGTCAAGG AATCTCAGAA ATGTACACAG TACGTAAAAT 180 
ibRK GQG ISE M Y T V RKI 

TTCTGGTGGT ATCGGTGTAG AGCGTACATT CCCAATTCAC ACTCCTCGTG TTGATAAAAT 240 

CGAAGTTGTT CGTTATGGTA AAGTACGTCG TGCTAAACTT TACTACTTAC GCGCATTGCA 300 
EVV RYGK V R R A K L YYLR ALQ 

AGGTAAAGCT GCACGTATTA AAGAAATCCG TCGTTAATTT TGATGATCAG ATTTTAAAAA 360 

TGCTTGGTTG TTTGAGGATA GTAACTATGT TTTAAAACTG GACAACCAAG ACGTAAAAAA 420 

TCTGCCTGTG GGCAGTTTTT TTACTAGGTC CCCTTAGTTC AATGGATATA ACAACTCCCT 480 

. H I Y C S G 

CCTAAGGAGT AATTGCTGGT TCGATTCCGG CAGGGGACAT ATTCATTGCA TGTAAATAGC 540 
YNST RNR CPV YENC TFL 

GGTTTAGAGC TATTTTGCCC CAAATTTCTC TGATTAAGTT TATCGTTCCT ATCTTTTTGT 600 
PKS SNQG LNR QNL KDNR DKQ 

TCTTGTAATT GATGTGCGTA AACTTCTAAA GTGATATTTA AATTCTCGTG ATCTAAAACT 660 
EQL QHAY VEL TIN LNEH DLV 

TGAGAGATGG AAATTAGATA GCTTGCAAAT GTATGCCTGA GAGAGTGCAC TCGTACCTCG 720 
UJ)I SILY SAF THR LSHV RVE 

C o AC £ AG I TA " T 3 TCGGAT AGTTTTATTG ACTGCATTAT TTGAAAGTTT GTCGAATAAT 780 
IKRI TKN VAN NSLK DFL 

CTGTCGTTTT TATTTTTTGT AAATTCATGC AAAAAAAATA ATGTATCATT GTCAATTGGT 840 
KNKT FEH LFF L.T. DN DIP 

ATATTTCTGA TACTACTTTT GTTTTTTGTT GGCAGGTATC TTTGGTTGAA ATGATAATCC 900 
ISSK NKT PLY RQNF HYD 

CAAGTTTTAT TAATTGATAA ATATTTGTTA GTGTAATCAA TATCATTAAC TGTTAAACCT 960 
NISL YKN TYD IDNV TLG 

AAACATTCAG CGAAGCGCAT GCCAGTTTTA GCGATGAGGT ATAACGCTGC ATACGATTGA 1020 

< 1 

TGTTGTGATT TTTCTTTACA AATTTTTATC AAGCGTAAGT ATTCATTGGT TTCAAGAAAT 1080 

TTTATCTCTA TTTACGCCCC TTATTTTTTG CTTTAACCTT AGTGAATAAA CAAAAATTTT 1140 

TTTCTATATA TCCCTCGTGA ACAGCCATGG ATACGCAGGC TTTTACATGT ATGTTAAAAC 1200 

GCTTTACTGT ATCTTGCACA TGCGTTTGAC TATAATGATT TATGACTTGT TGATATTTAG 1260 
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TGGAAGTAAT ATTGCAAAGT AATATATTTC CTATTATATG TTTATACGAT ATTCGATATT 1320 

CCCACCCGTT GTCGCGTTTA CGGAAATACG CCATTGATAT ACTCCACATT AGCTAAAGAA 1380 

CAGGGTGTTC AAGGCTACCT TGATGGAAAA GGCTCTCTTA GAGATATTTG TAAATGGTAT 1440 

GATATCTCAA GTCGCTCTGT TCTCCAAAAG TGGATAAAAC GGTATACTAG TGGTGAAGAC 1500 

TTGAAAGCCA CTAGTAGAGG ATATAGCCGT ATGAAACAAG GAAGGCAAGC CACATTTGAA 1560 

GAACGTGTAG AGATTGTTAA CTACACCATT GCCCATGGGA AAGACTATCA AGCAGCTATT 1620 

GAGAAGTTTG GTGTTTCCTA CCAACAAATT TATTCTTGGG TGCGTAAGCT TGAGAAGAAT 1680 

GGCTCACAAG GTTTGGTTGA TAGACGTGTG AAAGGGTTGG AGAGTAGGCC TGATTTAACC 1740 

GAGATTGAGC AACTTTAACT CAAGATTAAA CAATTGGAGG AACGTAATCG TCTCTTAGAA 1800 

ATCGAGGTTA GTTTACTAAA AAAGTTAGAA GACATCAAAC GAGGAAACAG ACGGTAAGAC 1860 

TAGGTAAGCA TTTAGCGGAG TTCCAAGTAA TCAAGAATTA TTACGATGAG GAATCTAATG 1920 

TGCCTATTCA GGCCTTATGC CAACTCTTGA AGGGGTCTCG TTCAGGCTAT TACAAGTGGC 1980 

TCAATCGTCA AAAAACAGAT TTTGAGACAA AAAATACAAA GCTAATGGCT AAAATCAAGG 2040 

AACTTCGTAG ACTCTACAAT GGTATCTTAG GTTATCGCCG TATGACAACA TTTATTAATC 2100 

GTCAACTTGG GACAACTTAA AACAAGAAAC GGATTCGTTG ATTGATGAAC ATTCTGGGGA 2160 

TTAGTTCAGT CATTCGTCGT GTTAGCCATG CTTGTACAAA AGCTGGTGAC AGATTTTACG 2220 

AAGAAAATAT TCTTAATCGT GAATTTACAG CCACAGCTCA TAACCAGAAA TGGTGCACAG 2280 

ATGTCACCTA TCTTCAATAC GGTCTGGGAG CTAAAGCTTA TCTCAGTGCG ATTAAAGACC 2340 

TGTATAACGG TTCTATTATC GCTTATGAGA TTAGTCACAA CAATGAAATC CACTTGTTAT 2400 

GAAGACCATT AAAAAGGGGC TAGAGCTCAA TCCAGGAGCC ACACCTATCA TCCATAGCGA 2460 

TTGAGGTAGT CAATATACTT CCAAAGAATA CCGTTATATC ATACAACAAG CTGGTCTGAC 2520 

CTTATCCATG TCCCGGATTG GCAAATGTAT TGATAATGCA CCAACTGAAA GTTTCTTTGG 2580 

GTTTTTCAAG ACTGAGTCTT ACCACCTTAA GAAATACAAC TCTTATGATG AGTTGGTCAA 2640 

TGATGTGGCA CGTTATATCG AATTCTACAA CACACAACGT TATCAATCAA AATTAAACAA 2700 

CCTGACTCCT CTAGAATTCA GGAATCAGGT TGCATAACTT ATCTTTTATT ATTTGACTGT 2760 

CTACTTGACA GGGAGCCGTT CAGATTGCTT AACCTTTCTA AATTTGCTAA AATAGCTACA 2820 

AGAAAACGAG CCATTTAATG CTTATTTCTT ATACTGTCTT GCCTCACGCT CTCCTGGACC 2880 

AAAAATTGAG CGTGAGGCTT TTTGTTTCAT TAAACGATGA TATTTCCATA TTCATCAGTT 2940 

TGTTTTCCGA GAGCCATCAA AGCTTCGATA AGGTCGATAA TTCCAGGAAT AAAGGTAATA 3000 

CTAAAAATAA TATATAAAAA AACCTGGCCT ATTTTTCCTG CGTAAAATTT ATGCGCTCCA 3060 

ATGCCGCCCA AAAGAACGTT AATAAAACAT AAACTACTAT GTTAGCATAA GACTTTATTT 3120 
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TTACAACTGA ATTTCATATA AATGGATTAG AGTAAGGGAT AAAAGAAATT AGCATAGCTC 3180 

TTTTGAAAAT AAAAAAATTA ATATAATATG GAAAAAATTT TATTTCATAA ACGTTTCATA 3240 

AAAGGTATGT AATCTAGTAT TTAGGCAACA CTATTTTGTC ACTGGTGTCT AGTAACTTAT 3300 

AGATTGATAA TTTTACTAGT AAACGTAATT CTTCGCTTTA AGAGTTAAAT GTCTATTTAT 3360 

TGTAAGCTAA ATTGGGAGGT GAACTTATGT AAAATTAGAT AGGTACTGTC AAGTACGGGA 3420 

TGATTATTGA AACAGCCAGT ATGCATCATA AAATCTGTAT TGCTTAATAA CTATTTCCTT 3480 

AACCAGACAT CAGTTCATTG TTTATCATCG CTACCCTAAG TCTAGTTTTT TCAATAGAGC 3540 

ATTAGGTAGT TTTTGATAAT AAAACTATAT AAACATGAGA ATTAGATTTC GTATTGCATT 3600 

CTTCATAATG AGTTATTTGA GATTTTCCTT TGAATAAATA GATACGAAAT TCAGTAACTT 3660 

CATATATAAA CGGCTCTATC ATTGAGATAG TTTGTCAAAT GAAGAAATTT TTAATGGAAA 3720 

TAGTTTTAAA AACATTAGTT GTAGGCGATG TAAAAATATT AATCCAGTGG ATGCAATAGT 3780 

TGCGGAGTAA AAATAGAGAG GAGTAATTAG GAAGTGATAA AAAATGCTAT AGCATATATT 3840 

ACCAGAAAAA AAAATAGAAC ACTTATTATA TTTGCTATTT TAACAATTGT TCTTTCTTGC 3900 

TTGTATTCAT GTTTAACAAT AATGAAATCA AGTAATGAAA TAGAAAAGGC TTTATATGAA 3960 

MK^S S N E I E K A LYE 

AGTTCTAATT CTTCAATATC AATTACAAAA AAAGATGGTA AATATTTTAA TATTAATCAA 4020 
SSNS SIS ITK KDGK Y F N INQ 

TTTAAGAATA TTGAAAAAAT AAAAGAGGTT GAAGAAAAAA TATTTCAATA TGATGGATTA 4080 
EKI KEV EEKI F Q Y D G L 

GCAAAATTGA AAGATCTTAA AGTAGTTAGT GGTGAGCAAA GTATAAATAG AGAAGATTTA 4140 
DLK VVS GEQS I N R EDL 

TCTGACGAAT TTAAAAATGT TGTTTCACTA GAAGCTACAA GTAATACTAA AAGAAATCTT 4200 
SDEF KNV VSL EATS NTK RNL 

TTATTTAGTA GTGGAGTATT TAGTTTTAAA GAAGGAAAAA ATATAGAAGA AAATGATAAG 4260 
LFSS GVF SFK EGKN IEE NDK 

AATTCAATTC TTGTTCATGA AGAATTTGCT AAACAAAACA AACTAAAATT GGGTGATGAA 4320 
NSIL VHE EFA KQNK L K L GDE 

ATTGATCTTG AATTACTAGA TACGGAAAAA AGTGGAAAAA TAAAAAGTCA TAAATTTAAA 4380 
IDLE LLD TEK SGKI K SH KF K 

ATTATAGGAA TCTTTTCTGG TAAAAAACAG GAAACATATA CAGGATTATC ATCTGATTTT 4440 
I I G 1 FSG KKQ ETYT GLS SDF 

AGCGAAAATA TGGTTTTTGT AGATTATTCA ACTAGCCAAG AAATATTAAA TAAATCAGAG 4500 
VFV DYS TSQE ILN KSE 

AATAATAGAA TTGCAAATAA AATTTTAATG TATTCTGGTA GTTTAGAATC TACAGAGCTT 4560 
ANK ILM YSGS LES TEL 

GCCTTAAACA AATTGAAAGA CTTTAAAATT GATAAGTCAA AGTATTCTAT TAAGAAAGAT 4620 
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ALNK LKD F K I DKSK Y S I KKD 

AATAAAGCAT TCGAAGAGTC TTTAGAGTCA GTGAGTGGAA TAAAACATAT AATTAAAATA 4680 
NKAF EES LES VSGI KHI I K I 

ATGACTTATT CGATTATGTT AGGTGGAATA GTTGTTCTTT CATTAATCTT GATTCTATGG 4 740 
MTYS IMLGGIVVLSLILILW 

TTAAGAGAAA GAATTTATGA AATAGGTATA TTTTTATCTA TTGGAACAAC TAAGATACAA 4 8 On 
LRER IYE IGI F L S I GTT KIQ 

ATTATAAGGC AATTTATATT TGAGTTAATA TTCATATCAA TACCAAGTAT AATATCCTCC 4860 
I I R Q FIF ELI FISI PSI ISS 

TTATTTTTAG GGAATCTACT ATTAAAAGTA ATTGTAGAAG GATTTATTAA CTCAGAGAAC 4920 
LFLG NLL LKV IVEG FIN SEN 

TCAATGATTT TCGGTGGAAG TTTAATAAAT AAAAGCAGTT TTATGTTAAA CATAACAACA 4 980 
SMIF GGS LIN KSSF MLN ITT 

CTTGCAGAAA GTTATTTAAT ATTAATAAGT ATTATTGTTT TATCAGTTGT AATGGCCTCT 5040 
L A E S YLI LIS I I V L SVV MAS 

TCATTAATAT TATTTAAGAA ACCACAAGAA ATATTATCAA AAATAAGTTA GGAGCAAATA 5100 

S L I L F K K P Q E I L S K IS. 

ATGGATATAT TAGAAATAAA GAATGTAAAT TACAGTTACG CAAATTCTAA AGAAAAAGTT 5160 

MDIL EIK NVN YSYA NSK EKV 

| > 

TTGTCAGGAG TAAATCAAAA ATTTGAACTT GGAAAGTTTT ATGCGATAGT AGGGAAGTCA 5220 

L S G V NQK FEL GKFY AIV GKS 

GGAACAGGAA AATCCACACT TCTTTCCTTA CTTGCAGGAC TTGATAAAGT TCAAACAGGA 5280 
GTGK STL LSL LAGL DKV QTG 

AAAATCTTGT TTAAGAATGA AGATATAGAA AAGAAAGGAT ATAGTAATCA CAGAAAAAAT 5340 
K I L F KNE DIE KKGY SNH RKN 

AATATATCTT TGGTATTTCA AAATTATAAT TTAATAGATT ATTTATCGCC GATTGAAAAT 5400 
NISL VFQ NYN LIDY LSP IEN 

ATTAGACTAG TAAATAAATC AGTAGATGAG AGTATCTTGT TCGAATTAGG TTTAGATAAA 5460 
I R L V NKS VDE SILF ELG LDK 

AAACAAATAA AAAGAAATGT TATGAAATTA TCTGGTGGTC AGCAACAAAG GGTAGCTATT 5520 
KQIK RNV MKL SGGQ QQR VAI 

GCTAGGGCAC TGGTATCAGA TGCCCCAATA ATACTAGCTG ATGAGCCTAC CGGTAACCTA 5580 
ARAL VSD API ILAD EPT GNL 

GACAGTGTTA CTGCTGGAGA AATAATT (SEQ ID NO: 27^ 5607 
DSVT AGE II. 



FIG. 5a 
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IQSLTEGQLR 
ISEMYTVRKI 
GKAARIKEIR 

FIG. 5b 
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MKSSNEIEKA 


LYESSNSSIS 


ITKKDGKYFN 


INQFKNIEKI 


KEVEEKIFQY 


50 


DGLAKLKDLK 


VVSGEQSINR 


EDLSDEFKNV 


VSLEATSNTK 


RNLLFSSGVF 


100 


SFKEGKNIEE 


NDKNSILVHE 


EFAKQNKLKL 


GDEIDLELLD 


TEKSGKIKSH 


150 


KFKIIGIFSG 


KKQETYTGLS 


SDFSENMVFV 


DYSTSQEILN 


KSENNRIANK 


200 


ILMYSGSLES 


TELALNKLKD 


FKIDKSKYSI 


KKDNKAFEES 


LESVSGIKHI 


250 


IKIMTYSIML 


GGIVVLSLIL 


ILWLRERIYE 


IGIFLSIGTT 


KIQIIRQFIF 


300 


ELIFISIPSI 


ISSLFLGNLL 


LKVIVEGFIN 


SENSMIFGGS 


LINKSSFMLN 


350 


ITTLAESYLI 


LISIIVLSVV 


MASSLILFKK 


PQEILSKIS 




389 


(SEQ ID NO:30) 











FIG. 5d 

MDILEIKNVN YSYANSKEKV LSGVNQKFEL GKFYAIVGKS GTGKSTLLSL 50 

LAGLDKVQTG KILFKNEDIE KKGYSNHRKN NISLVFQNYN LIDYLSPIEN 100 

IRLVNKSVDE SILFELGLDK KQIKRNVMKL SGGQQQRVAI ARALVSDAPI 150 

ILADEPTGNL DSVTAGEII (SEQ ID NO: 31) 169 

FIG . 5e 



SDIPEFRAGD TVRVHAKVVE GTRERIQIFE GWISRKGQG 50 
SGGIGVERTF PIHTPRVDKI EVVRYGKVRR AKLYYLRALQ 100 
R (SEQ ID NO:28) 1in 
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CATATGACAA TATTTTTCAA AGTCTACATC ACTTACTCGC CTGTCGTGGA AAATCTGGCA 60 
ATACATTAAT CGACCAATTA GTTGCTGATG GTTTACTTCA TGCAGATAAT CACTACCATT 120 
TTTTCAATGG GAAGTCTCTG GCCACTTTCA ATACTAACCA ATTGATTCGC GAAGTTGTCT 180 
ATGTTGAAAT ATCCTTAGAT ACTATGTCTA GTGGTGAACA TGATTTAGTA AAAGTTAACA 240 
TTATCAGACC CACTACCGAG CATACTATCC CCACGATGAT GACAGCTAGC CCCTATCATC 300 
AAGGTATCAA TGATCCTGCC GCAGACCAAA AAACATACCA AATGGAGGGT GCGCTAGCAG 360 
TTAAACAGCC TAAACACATA CAAGTTGACA CAAAACCATT TAAAGAAGAA GTAAAACATC 420 
CTTCAAAATT ACCCATCAGC CCTGCAACTG AAAGCTTCAC ACACATTGAC AGTTATAGTC 480 
TCAATGACTA TTTTCTTTCT CGTGGTTTTG CTAATATATA CGTTTCAGGT GTGGGTACTG 540 
CTGGCTCTAC GGGTTTCATG ACCAGTGGGG ATTACCAACA AATACAAAGC TTTAAAGCAG 600 
TCATTGATTG GTTAAATGGT AAGGTTACTG CATTCACAAG TCATAAACGA GATAAACAAG 660 
TCAAGGCTGA TTGGTCAAAC GGCCTTGTAG CAACCACAGG TAAATCTTAT CTCGGTACCA 720 
TGTCAACTGG TTTAGCAACA ACTGGCGTTG AGGGGCTGAA AGTCATTATC GCTGAAGCCG 780 
CAATCTCCAC ATGGTATGAT TATTATCGAG AAAATGGGCT TGTGTGTAGT CCAGGCGGCT 840 
ACCCCGGTGA AGATTTAGAC GTTTTAACAG AATTAACATA CTCACGAAAC CTCTTAGCTG 900 
GTGATTACAT CAAAAACAAC GATTGCTATC AAGCATTGTT AAATGAACAA TCAAAAGCAA 960 
TTGACCGTCA AAGTGGGGAT TACAACCAAT ACTGGCATGA CCGTAATTAC CTAACTCACG 1020 
TCAATAATGT CAAAAGTCGA GTAGTTTACA CTCATGGACT ACAGGATTGG AATGTTAAGC 1080 
CAAGACATGT CTACAAAGTT TTCAATGCAT TGCCTCAAAC CATCAAAAAA CACCTTTTTT 1140 
TACATCAAGG TCAACATGTG TATATGCATA ATTGGCAGTC GATTGATTTT CGTGAAAGCA 1200 
TGAATGCCTT ACTAAGCCAA GAACTACTTG GCATTGACAA TCATTTCCAA TTAGAAGAGG 1260 
TCATTTGGCA AGATAATACT ACTGAGCAAA CTTGGCAAGT TTTAGATGCT TTCGGAGGAA 1320 
ACCATCAAGA GCAAATTGGT TTAGGTGATA GTAAAAAACT TATTGATAAC CATTATGACA 1380 
AAGAAGCCTT TGATACTTAT TGTAAAGACT TCAATGTGTT CAAAAATGAT CTTTTCAAGG 1440 
GAAATAATAA AACCAATCAA ATCACTATTA ATCTTCCTCT AAAGAAAAAT TATCTCCTGA 1500 
ATGGACAGTG CAAACTCCAT CTACGTGTTA AAACTAGTGA CAAAAAGGCC ATTTTATCAG 1560 
CCCAAATCTT AGACTATGGT CCTAAAAAAC GATTCAAAGA TACACCAACC ATCAAATTCT 1620 
TAAACAGCCT TGATAATGGT AAAAATTTTG CCAGAGAAGC TTTACGTGAA CTCCCGTTTA 1680 
CTAAAGATCA TTATCGTGTC ATCAGTAAAG GTGTCTTGAA CCTTCAAAAT CGTACAGACT 1740 
TACTTACAAT TGAGGCTATC GAGCCAGAAC AATGGTTTGA TATCGAGTTT AGCCTCCAAC 1800 
CAAGTATATA TCAATTGAGT AAAGGTGATA ATCTAAGGAT TATCCTTTAT ACAACTGATT 1860 
TTGAACATAC CATTCGAGAT AATGCTAGTT ACTCTATAAC AGTAGATTTG AGTCAATCTT 1920 
ATTTAACTAT CCCAACTAAT CAAGGAAATT AACTTATGAA ACTTCTTACT AAAGAACGGT 1980 
TTGATGATTC TCAACACTTT TGGTACCAGA TCAATTTATT ACAAGAGAGT AACTTCGGAG 2040 
CAGTTTTTGA CCATGATAAT AAAAACATTC CACAGGTTGT TGCAACTATT GTTGATGATT 2100 
TACAAGGTTC CGGAAGTTCG AATCATTTCT GGTATTTTGG CAATACTACT GATACTTCCA 2160 
TCCTTATGAT TGCTCATTTA AATCGAAAAT TCTATATTCA GGTTAATTTA AAGGACTTTG 2220 
ACTTTGCACT CAATTTAATA GCTATAAATA ATTGGAAGAG TCTCCTCCAA ACTCAACTTG 2280 
AAGCTCTAAA CGATACCCTA GCAATATTTC AATAAATAAG GTAGAATGGA GTGACAAAGC 2340 
AACGCGAGGG AGACTGATTA ATGTCATCTT ATTGGAATAA CTATCCTGAA CTTAAAAAAA 2400 
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ATATTGATGA AACCAATCAA CTAATTCAAG AAAGAATACA GGTCAGAAAT AAAGATATTG 2460 

AAGCGGCGCT AAGCCAACTC ACAGCTGCGG GAGGAAAACA GCTCAGACCA GCATTCTTTT 2520 

ACCTTTTTTC TCAACTTGGT AATAAGGAGA ATCAAGATAC TCAGCAACTA AAGAAAATCG 2580 

CTGCTTCTTT AGAAATCCTT CACGTTGCTA CATTAATCCA TGATGATGTC ATTGATGACT 2640 

CACCACTAAG ACGTGGAAAT ATGACCATTC AAAGCAAGTT TGGCAAAGAC ATCGCAGTTT 2700 

ATACTGGGGA TTTACTTTTC ACAGTCTTTT TCGATCTTAT TTTAGAATCT ATGACTGATA 2760 

CACCATTTAT GAGGATTAAT GCAAAATCTA TGCGTAAAAT TCTCATGGGA GAATTGGACC 2820 

AGATGCACCT TCGTTACAAT CAACAACAAG GTATCCATCA CTATTTACGT GCGATTTCAG 2880 

GTAAGACAGC CGAACTCTTT AAATTAGCTA GCAAAGAAGG AGCTTACTTT GGTGGTGCAG 2940 

AGAAGGAGGT TGTTCGTCTA GCAGGCCATA TCGGCTTTAA CATTGGTATG ACATTCCAAA 3000 

TTTTGGATGA TATCCTGGAT TATACTGCAG ATAAAAAAAC ATTTAATAAG CCTGTCTTAG 3060 

AGGATTTAAC ACAAGGCGTT TACAGCCTTC CTCTACTTCT TGCCATTGAA GAAAATCCTG 3120 

ATATTTTCAA ACCTATTTTA GATAAAAAAA CAGATATGGC TACTGAAGAC ATGGAAAAAA 3180 

TTGCTTATCT CGTCGTTTCC CATAGAGGTG TTGACAAAGC TCGCCATCTA GCTCGTAAAT 3240 

TTACTGAGAA AGCTATTAGT GACATAAATA AGCTACCCCA GAACTCTGCA AAAAAACAGT 3300 

TGCTACAATT AACTAATTAC CTTTTAAAAC GCAAAATTTA AATAATAAAA AAACATTCCA 3360 

CAATGCTAGA AAAGCAGTTA GGGAATGTTT TTTTATTATC ATTTATTTAT CGCACCTATC 3420 

AATCATCATA GATCACCATC ATCAGCGGCT TTCAGCTGAC GGTAACGTTG ACTACTTTGA 3480 

GACAATTCTT GAGGAGAACC TTCCAACTCT AATTGCCCAT TTTCTATAAA TAAGATACGA 3540 

TCAGCATGTT CAATACCTTT TAAGTGATGT GTAATCCAAA CTAAGGTCTT ACCTTCCAAT 3600 

TCTTTCATAA ATACCCTTAG TAAGGCTTGT TCAGTAATAG GATCAAGTCC AACAGTTGGC 3660 

TCATCTAAGA TAACAATTGG GACATCTTTT AGTAAGATTC TAGCCAAAGC AATTCTATGC 3720 

CTTTCGCCAC CTGAAAACCT AAGTCCAGCT TCATCAACCA TTGTATAGAG ACCATCTGAT 3780 

AAATCAGTGA CCATCTCTTT CAATCCAACT CGTTCAAGAA CTTTCCATAC ATCTTCTTCA 3840 

CTAGCATCTT GGTTTCCAAT GCGAATGTTA TTTAGCAGGG TTGTATTAAA AAGGTAGGGC 3900 

GCTTGTTGTA TCACTCCAAT ATAGTTAGAA ATGCAATCAC CAACTATTGA AACATCAGCA 3960 

CCGCCTAGGG TAATCTTCCC TTGACTTGCT TTCAAGTCGC CACGAAGTAG ACTAGCTAAG 4020 

GTACTCTTGC CAGAACCACT CCGCCCTAAA ATAGCAATTT TTTCTCCTTC TTTAATATCC 4 080 

AAATCTAAAT GATGCAAAAC CCATTTCTCT TGTGGCTTAT ACTGGAAACT TAAATTCTTG 4140 

ACGGAAAAAT CATATGGCTT ATTAGGCAAT T {SEQ ID NO: 32) 4171 

FIG. 6a 
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YDNIFQSLHH LLACRGKSGN TLIDQLVADG 
TNQLIREWY VEISLDTMSS GEHDLVKVNI 
GINDPAADQK TYQMEGALAV KQPKHIQVDT 
SFTHIDSYSL NDYFLSRGFA NIYVSGVGTA 
IDWLNGKVTA FTSHKRDKQV KADWSNGLVA 
GLKVIIAEAA ISTWYDYYRE NGLVCSPGGY 
DYIKNNDCYQ ALLNEQSKAI DRQSGDYNQY 
HGLQDWNVKP RHVYKVFNAL PQTIKKHLFL 
NALLSQELLG IDNHFQLEEV IWQDNTTEQT 
KKLIDNHYDK EAFDTYCKDF NVFKNDLFKG 
GQCKLHLRVK TSDKKAILSA QILDYGPKKR 
REALRELPFT KDHYRVISKG VLNLQNRTDL 
SIYQLSKGDN LRIILYTTDF EHTIRDNASY 
(SEQ ID NO:33) 

FIG. 6 



LLHADNHYHF FNGKSLATFN 50 

IRPTTEHTIP TMMTASPYHQ 100 

KPFKEEVKHP SKLPISPATE 150 

GSTGFMTSGD YQQIQSFKAV 200 

TTGKSYLGTM STGLATTGVE 250 

PGEDLDVLTE LTYSRNLLAG 300 

WHDRNYLTHV NNVKSRWYT 350 

HQGQHVYMHN WQSIDFRESM 400 

WQVLDAFGGN HQEQIGLGDS 450 

NNKTNQITIN LPLKKNYLLN 500 

FKDTPTIKFL NSLDNGKNFA 550 

LTIEAIEPEQ WFDIEFSLQP 600 

SITVDLSQSY LTIPTNQGN 649 



MKLLTKERFD DSQHFWYQIN LLQESNFGAV FDHDNKNIPQ WATIVDDLQ 50 

GSGSSNHFWY FGNTTDTSIL MIAHLNRKFY IQVNLKDFDF ALNLIAINNW 100 

KSLLQTQLEA LNDTLAIFQ (SEQ ID NO: 34) 119 

FIG. 6c 

MSSYWNNYPE LKKNIDETNQ LIQERIQVRN KDIEAALSQL TAAGGKQLRP 50 

AFFYLFSQLG NKENQDTQQL KKIAASLEIL HVATLIHDDV IDDSPLRRGN 100 

MTIQSKFGKD IAVYTGDLLF TVFFDLILES MTDTPFMRIN AKSMRKILMG 150 

ELDQMHLRYN QQQGIHHYLR AISGKTAELF KLASKEGAYF GGAEKEWRL 200 

AGHIGFNIGM TFQILDDILD YTADKKTFNK PVLEDLTQGV YSLPLLLAIE 250 

ENPDIFKPIL DKKTDMATED MEKIAYLWS HRGVDKARHL ARKFTEKAIS 300 

DINKLPQNSA KKQLLQLTNY LLKRKI (SEQ ID NO: 35) 326 

FIG. 6d 
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LPNKPYDFSV KNLSFQYKPQ EKWVLHHLDL DIKEGEKIAI LGRSGSGKST 50 

LASLLRGDLK ASQGKITLGG ADVSIVGDCI SNYIGVIQQA PYLFNTTLLN 100 

NIRIGNQDAS EEDVWKVLER VGLKEMVTDL SDGLYTMVDE AGLRFSGGER 150 

HRIALARILL KDVPIVILDE PTVGLDPITE QALLRVFMKE LEGKTLVWIT 200 

HHLKGIEHAD RILFIENGQL ELEGSPQELS QSSQRYRQLK AADDGDL 247 
(SEQ ID NO:36) 

FIG. 6e 
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AATTCTATTT GGAGGTTTTT CTTGAATAAA TGGTTAGTTA AGGCAAGTTC CTTAGTTGTT 60 

TTAGGTGGTA TGGTTTTATC TGCGGGTTCC CGAGTTTTAG CGGATACTTA TGTCCGTCCA 120 

ATTGATAATG GTAGAATTAC AACAGGTTTC AATGGTTATC CTGGACATTG TGGGGTGGAT 180 

TATGCTGTTC CGACTGGAAC GATTATTAGG GCAGTGGCAG ATGGTACTGT GAAATTTGCA 240 

GGAGCTGGAG CCAACTTTTC TTGGATGACA GACTTAGCAG GAAATTGTGT CATGATTCAA 300 

CATGCGGATG GAATGCATAG TGGTTACGCT CATATGTCAC GTGTGGTGGC TAGGACTGGG 360 

GAAAAAGTCA AACAAGGAGA TATCATCGGT TACGTAGGAG CAACTGGTAT GGCGACGGGA 420 

CCTCACCTTC ATTTTGAATT TTTACCAGCT AACCCTAATT TTCAAAATGG TTTCCATGGA 480 

CGTATCAATC CAACGTCACT AATTGCTAAC GTTGCGACCT TTAGTGGAAA AACGCAAGCA 540 

TCAGCTCCAA GCATTAAGCC ATTACAATCA GCTCCTGTAC AGAATCAATC TAGTAAATTA 600 

AAAGTGTATC GAGTAGATGA ATTACAAAAG GTTAATGGTG TTTGGTTAGT CAAAAATAAC 660 

ACCCTAACGC CGACTGGGTT TGATTGGAAC GATAATGGTA TACCAGCATC AGAAATTGAT 720 

GAGGTTGATG CTAATGGTAA TTTGACAGCT GACCAGGTTC TTCAAAAAGG TGGTTACTTT 780 

ATCTTTAATC CTAAAACTCT TAAGACTGTA GAAAAACCCA TCCAAGGAAC AGCTGGTTTA 840 

ACTTGGGCTA AGACACGCTT TGCTAATGGT AGTTCAGTTT GGCTTCGCGT TGACAACAGT 900 

CAAGAACTGC TTTACAAATA GTTTGAGGTA TTGATTCATT GTTTTAAATG ACAGTTTTGT 960 

TACTAACTAA GTACAATTTC TTTAAACCGT CTGAAAATAA TTTTATAGTC CAGTAAAGTG 1020 

TGATATTATA GTCTCGGACT AATAAAAAGG AAATAGGAAT TGAAGCAATG AAAATGAATA 1080 

AAAAGGTACT ATTGACATCG ACAATGGCAG CTTCGCTATT ATCAGTCGCA AGTGTTCAAG 1140 

CACAAGAAAC AGATACGACG TGGACAGCAC GTACTGTTTC AGAGGTAAAG GCTGATTTGG 1200 

TAAAGCAAGA CAATAAATCA TCATATACTG TGAAATATGG TGATACACTA AGCGTTATTT 1260 

CAGAAGCAAT GTCAATTGAT ATGAATGTCT TAGCAAAAAT TAATAACATT GCAGATATCA 1320 

ATCTTATTTA TCCTGAGACA ACACTGACAG TAACTTACGA TCAGAAGAGT CATACTGCCA 1380 

CTTCAATGAA AATAGAAACA CCAGCAACAA ATGCTGCTGG TCAAACAACA GCTACTGTGG 1440 

ATTTGAAAAC CAATCAAGTT TCTGTTGCAG ACCAAAAAGT TTCTCTCAAT ACAATTTCGG 1500 

AAGGTATGAC ACCAGAAGCA GCAACAACGA TTGTTTCGCC AATGAAGACA TATTCTTCTG 1560 

CGCCAGCTTT GAAATCAAAA GAAGTATTAG CACAAGAGCA AGCTGTTAGT CAAGCAGCAG 1620 

CTAATGAACA GGTATCAACA GCTCCTGTGA AGTCGATTAC TTCAGAAGTT CCAGCAGCTA 1680 

AAGAGGAAGT TAAACCAACT CAGACGTCAG TCAGTCAGTC AACAACAGTA TCACCAGCTT 1740 

CTGTTGCCGC TGAAACACCA GCTCCAGTAG CTAAAGTAGC ACCGGTAAGA ACTGTAGCAG 1800 

CCCCTAGAGT GGCAAGTGTT AAAGTAGTCA CTGCTAAAGT AGAAACTGGT GCATCACCAG 1860 

AGCATGTATC AGCTCCAGCA GTTCCTGTGA CTACGACTTC AACAGCTACA GACAGTAAGT 1920 

TACAAGCGAC TGAAGTTAAG AGCGTTCCGG TAGCACAAAA AGCTCCAACA GCAACACCGG 1980 

TAGCACAACC AGCTTCAACA ACAAATGCAG TAGCTGCACA TCCTGAAAAT GCAGGGCTCC 2040 

AACCTCATGT TGCAGCTTAT AAAGAAAAAG TAGCGTCAAC TTATGGAGTT AATGAATTCA 2100 

GTACATACCG TGCAGGTGAT CCAGGTGATC ATGGTAAAGG TTTAGCAGTC GACTTTATTG 2160 

TAGGTAAAAA CCAAGCACTT GGTAATGAAG TTGCACAGTA CTCTACACAA AATATGGCAG 2220 

CAAATAACAT TTCATATGTT ATCTGGCAAC AAAAGTTTTA CTCAAATACA AATAGTATTT 2280 

ATGGACCTGC TAATACTTGG AATGCAATGC CAGATCGTGG TGGCGTTACT GCCAACCATT 2340 

ATGACCATGT TCACGTATCA TTTAACAAAT AATATAAAAA AGGAAGCTAT TTGGCTTCTT 2400 
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TTTTATATGC CTTGAATAGA CTTTCAAGGT TCTTATCTAA TTTTTATTAA ATTGAGGAGA 2460 

TTAAGCTATA AGTCTGAAAC TACTTTCACG TTAACCGTGA CTAAATCAAA ACGTTAAAAC 2520 

TAAAATCTAA GTCTGTAAAG ATTATTGAAA ACGCTTTAAA AACAGATATA ATAAGGTTTG 2580 

TAGATATCTA AAATTAAAAA AGATAAGGAA GTGAGAATAT GCCACATCTA AGTAAAGAAG 2640 

CTTTTAAAAA GCAAATAAAA AATGGCATTA TTGTGTCATG TCAAGCTTTG CCTGGGGAGC 2700 

CTCTTTATAC TGAAAGTGGA GGTGTTATGC CTCTTTTAGC TTTGGCAGCT CAAGAAGCAG 2760 

GAGCGGTTGG TATAAGAGCC AATAGTGTCC GCGACATTAA GGAAATTCAA GAAGTTACTA 2820 

ATTTACCTAT CATCGGCATT ATTAAACGTG AATATCCTCC ACAAGAACCA TTTATCACTG 2880 

CTACGATGAC AGAGGTGGAT CAATTAGCTA GTTTAGATAT TGCAGTAATA GCCTTAGATT 2940 

GTACACTTAG AGAGCGTCAT GATGGTTTGA GTGTAGCTGA GTTTATTCAA AAGATAAAAG 3000 

GGAAATATCC TGAACAGTTG CTAATGGCTG ATATAAGTAC TTTTGAAGAA GGTAAAAATG 3060 

CTTTTGAAGC AGGAGTTGAT TTTGTGGGTA CAACTCTATC TGGATACACA GATTACAGCC 3120 

GCCAAGAAGA AGGACCGGAT ATAGAACTCC TTAATAAGCT TTGTCAAGCC GGTATAGATG 3180 

TGATTGCGGA AGGTAAAATT CATACTCCTA AGCAAGCTAA TGAAATTAAT CATATAGGTG 3240 

TTGCAGGAAT TGTAGTTGGT GGTGCTATCA CTAGACCAAA AGAAATAGCG GAGCGTTTCA 3300 

TCTCAGGACT TAGTTAAAAG TGTTACTCAA AAATCAAAAT CAAAATAAAA AAGGGGAATA 3360 

GTTATGAGTA TCAAAAAAAG TGTGATTGGT TTTTGCCTCG GAGCTGCAGC ATTATCAATG 3420 

TTTGCTTGTG TAGACAGTAG TCAATCTGTT ATGGCTGCCG AGAAGGATAA AGTCGAAATT 3480 
(SEQ ID NO:37) 

FIG. 7a 



NSIWRFFLNK WLVKASSLVV LGGMVLSAGS RVLADTYVRP IDNGRITTGF 50 

NGYPGHCGVD YAVPTGTIIR AVADGTVKFA GAGANFSWMT DLAGNCVMIQ 100 

HADGMHSGYA HMSRVVARTG EKVKQGDIIG YVGATGMATG PHLHFEFLPA 150 

NPNFQNGFHG RINPTSLIAN VATFSGKTQA SAPSIKPLQS APVQNQSSKL . 200 

KVYRVDELQK VNGVWLVKNN TLTPTGFDWN DNGIPASEID EVDANGNLTA 250 

DQVLQKGGYF IFNPKTLKTV EKPIQGTAGL TWAKTRFANG SSVWLRVDNS 300 

QELLYK (SEQ ID NO:38) 306 

FIG. 7b 
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MKMNKKVLLT STMAASLLSV ASVQAQETDT TWTARTVSEV KADLVKQDNK 50 

SSYTVKYGDT LSVISEAMSI DMNVLAKINN IADINLIYPE TTLTVTYDQK 100 

SHTATSMKIE TPATNAAGQT TATVDLKTNQ VSVADQKVSL NTISEGMTPE 150 

AATTIVSPMK TYSSAPALKS KEVLAQEQAV SQAAANEQVS TAPVKSITSE 200 

VPAAKEEVKP TQTSVSQSTT VSPASVAAET PAPVAKVAPV RTVAAPRVAS 250 

VKWTPKVET GASPEHVSAP AVPVTTTSTA TDSKLQATEV KSVPVAQKAP 300 

TATPVAQPAS TTNAVAAHPE NAGLQPHVAA YKEKVASTYG VNEFSTYRAG 350 

DPGDHGKGLA VDFIVGKNQA LGNEVAQYST QNMAANNISY VIWQQKFYSN 400 

TNSIYGPANT WNAMPDRGGV TANHYDHVHV SFNK (SEQ ID NO:39) 434 

FIG. 7c 

MPHLSKEAFK KQIKNGIIVS CQALPGEPLY TESGGVMPLL ALAAQEAGAV 50 

GIRANSVRDI KEIQEVTNLP IIGIIKREYP PQEPFITATM TEVDQLASLD 100 

IAVIALDCTL RERHDGLSVA EFIQKIKGKY PEQLLMADIS TFEEGKNAFE 150 

AGVDFVGTTL SGYTDYXRQE EGPDIELLNK LCQAGIDVIA EGKIHTPKQA 200 

NEINHIGVAG IVVGGAITRP KEIAERFISG LS (SEQ ID NO: 40) 232 

FIG. 7d 



MSIKKSVIGF CLGAAALSMF ACVDSSQSVM AAEKDKVEI 39 
(SEQ ID NO:41) 

FIG. 7e 
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ATGAAAATGA ATAAAAAGGT ACTATTGACA TCGACAATGG CAGCTTCGCT SO 

ATTATCAGTC GCAAGTGTTC AAGCACAAGA AACAGATACG ACGTGGACAG 100 

CACGTACTGT TTCAGAGGTA AAGGCTGATT TGGTAAAGCA AGACAATAAA 150 

TCATCATATA CTGTGAAATA TGGTGATACA CTAAGCGTTA TTTCAGAAGC 2do 

AATGTCAATT GATATGAATG TCTTAGCAAA AATTAATAAC ATTGCAGATA 350 

TCAATCTTAT TTATCCTGAG ACAACACTGA CAGTAACTTA CGATCAGAAG ton 

AGTCATACTG CCACTTCAAT GAAAATAGAA ACACCAGCAA CAAATGCTGC tin 

TGGTCAAACA ACAGCTACTG TGGATTTGAA AACCAATCAA GTTTCTGTTG 400 

CAGACCAAAA AGTTTCTCTC AATACAATTT CGGAAGGTAT GACACCAGAA 4 50 

GCAGCAACAA CGATTGTTTC GCCAATGAAG ACATATTCTT CTGCGCCAGC 500 

TTTGAAATCA AAAGAAGTAT TAGCACAAGA GCAAGCTGTT AGTCAAGCAG 550 

CAGCTAATGA ACAGGTATCA ACAGCTCCTG TGAAGTCGAT TACTTCAGAA 600 

GTTCCAGCAG CTAAAGAGGA AGTTAAACCA ACTCAGACGT CAGTCAGTCA 650 

GTCAACAACA GTATCACCAG CTTCTGTTGC CGCTGAAACA CCAGCTCCAG 700 

TAGCTAAAGT AGCACCGGTA AGAACTGTAG CAGCCCCTAG AGTGGCAAGT 750 

GTTAAAGTAG TCACTCCTAA AGTAGAAACT GGTGCATCAC CAGAGCATGT 800 

ATCAGCTCCA GCAGTTCCTG TGACTACGAC TTCAACAGCT ACAGACAGTA 850 

AGTTACAAGC GACTGAAGTT AAGAGCGTTC CGGTAGCACA AAAAGCTCCA 900 

ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC 950 

ACATCCTGAA AATGCAGGGC TCCAACCTCA TGTTGCAGCT TATAAAGAAA 1000 

AAGTAGCGTC AACTTATGGA GTTAATGAAT TCAGTACATA CCGTGCAGGT 1050 

GATCCAGGTG ATCATGGTAA AGGTTTAGCA GTCGACTTTA TTGTAGGTAA 1100 

AAACCAAGCA CTTGGTAATG AAGTTGCACA GTACTCTACA CAAAATATGG 1150 

CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTACTCAAAT 1200 

ACAAATAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG 1250 

TGGTGGCGTT ACTGCCAACC ATTATGACCA TGTTCACGTA TCATTTAACA 1300 
AATAA 



(SEQ ID NO:42) 

FIG. 8 
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CAAGAAACAG 

TGATTTGGTA 

ATACACTAAG 

GCAAAAATTA 

ACTGACAGTA 

TAGAAACACC 

TTGAAAACCA 

AATTTCGGAA 

TGAAGACATA 

CAAGAGCAAG 

TCCTGTGAAG 

AACCAACTCA 

GTTGCCGCTG 

TGTAGCAGCC 

AAACTGGTGC 

ACGACTTCAA 

CGTTCCGGTA 

CTTCAACAAC 

CCTCATGTTG 

TGAATTCAGT 

TAGCAGTCGA 

GCACAGTACT 

CTGGCAACAA 

ATACTTGGAA 

GACCATGTTC 



ATACGACGTG 

AAGCAAGACA 

CGTTATTTCA 

ATAACATTGC 

ACTTACGATC 

AGCAACAAAT 

ATCAAGTTTC 

GGTATGACAC 

TTCTTCTGCG 

CTGTTAGTCA 

TCGATTACTT 

GACGTCAGTC 

AAACACCAGC 

CCTAGAGTGG 

ATCACCAGAG 

CAGCTACAGA 

GCACAAAAAG 

AAATGCAGTA 

CAGCTTATAA 

ACATACCGTG 

CTTTATTGTA 

CTACACAAAA 

AAGTTTTACT 

TGCAATGCCA 

ACGTATCATT 



GACAGCACGT 

ATAAATCATC 

GAAGCAATGT 

AGATATCAAT 

AGAAGAGTCA 

GCTGCTGGTC 

TGTTGCAGAC 

CAGAAGCAGC 

CCAGCTTTGA 

AGCAGCAGCT 

CAGAAGTTCC 

AGTCAGTCAA 

TCCAGTAGCT 

CAAGTGTTAA 

CATGTATCAG 

CAGTAAGTTA 

CTCCAACAGC 

GCTGCACATC 

AGAAAAAGTA 

CAGGTGATCC 

GGTAAAAACC 

TATGGCAGCA 

CAAATACAAA 

GATCGTGGTG 

TAACAAATAA 



ACTGTTTCAG 
ATATACTGTG 
CAATTGATAT 
CTTATTTATC 
TACTGCCACT 
AAACAACAGC 
CAAAAAGTTT 
AACAACGATT 
AATCAAAAGA 
AATGAACAGG 
AGCAGCTAAA 
CAACAGTATC 
AAAGTAGCAC 
AGTAGTCACT 
CTCCAGCAGT 
CAAGCGACTG 
AACACCGGTA 
CTGAAAATGC 
GCGTCAACTT 
AGGTGATCAT 
AAGCACTTGG 
AATAACATTT 
TAGTATTTAT 
GCGTTACTGC 
(SEQ ID 



AGGTAAAGGC 

AAATATGGTG 

GAATGTCTTA 

CTGAGACAAC 

TCAATGAAAA 

TACTGTGGAT 

CTCTCAATAC 

GTTTCGCCAA 

AGTATTAGCA 

TATCAACAGC 

GAGGAAGTTA 

ACCAGCTTCT 

CGGTAAGAAC 

CCTAAAGTAG 

TCCTGTGACT 

AAGTTAAGAG 

GCACAACCAG 

AGGGCTCCAA 

ATGGAGTTAA 

GGTAAAGGTT 

TAATGAAGTT 

CATATGTTAT 

GGACCTGCTA 

CAACCATTAT 

NO:43) 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1230 
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QETDTTWTAR 
AKINNIADIN 
LKTNQVSVAD 
QEQAVSQAAA 
VAAETPAPVA 
TTSTATDSKL 
PHVAAYKEKV 
AQYSTQNMAA 
DHVHVSFNK 



TVSEVKADLV 
LIYPETTLTV 
QKVSLNTISE 
NEQVSTAPVK 
KVAPVRTVAA 
QATEVKSVPV 
ASTYGVNEFS 
NNISYVIWQQ 
(SEQ ID NO: 



KQDNKSSYTV 
TYDQKSHTAT 
GMTPEAATTI 
SITSEVPAAK 
PRVASVKWT 
AQKAPTATPV 
TYRAGDPGDH 
KFYSNTNSIY 
44) 



KYGDTLSVIS 
SMKIETPATN 
VSPMKTYSSA 
EEVKPTQTSV 
PKVETGASPE 
AQPASTTNAV 
GKGLAVDFIV 
GPANTWNAMP 



EAMSIDMNVL 
AAGQTTATVD 
PALKSKEVLA 
SQSTTVSPAS 
HVSAPAVPVT 
AAHPENAGLQ 
GKNQALGNEV 
DRGGVTANHY 



50 
100 
150 
200 
250 
300 
350 
400 
409 
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Experience MB-14 
NCS 954 (type Ul) 



posMst post-2nd post-3rd 
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Experience MB-20 
NCS 535 (type V) 



posMst post-2nd post-3rd 
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Experience MB-16 
ATCC12401 (type ib) 
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SEQUENCE LISTING 

<110> BioChem Vaccins 
RIOUX, Clement 
DENIS , Martin 
BRODEUR , Bernard R . 
HAMEL, Josee 
CHARLEBOIS , Isabel le 
BOYER, Martine 



<120> NOVEL GROUP B STREPTOCOCCUS ANTIGENS 



<130> 12806-9PCT 

<150> 60/075,425 
<151> 1998-02-20 

<160> 44 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 4514 
<212> DNA 

<213> Streptococcus 

<220> 

<221> CDS 

<222> (3)... (464) 

<221> CDS 

<222> (534) . . . (887) 
<223> 



<221> 


CDS 




<222> 


(1024) . . 


. (1767) 


<221> 


CDS 




<222> 


(1841) . . 


. (4288) 


<221> 


CDS 




<222> 


(2735) . . 


. (4288) 


<400> 


1 





ta tct ggc aaa gag cca get aat cgt ttt agt tgg get aaa aat aaa 47 
Ser Gly Lys Glu Pro Ala Asn Arg Phe Ser Trp Ala Lys Asn Lys 
1 5 io 15 
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tta tta ate aat gga ttc att gca act eta gca gca act ate tta ttt 95 
Leu Leu lie Asn Gly Phe He Ala Thr Leu Ala Ala Thr He Leu Phe 
20 25 30 

ttt gca gtt caa ttc ata ggt ctt aaa cca gat tac cct gga aaa acc 143 
Phe Ala Val Gin Phe He Gly Leu Lys Pro Asp Tyr Pro Gly Lys Thr 
35 40 45 

tac ttt att ate eta ttg aca gca tgg act ttg atg gca tta gta act 191 
Tyr Phe He He Leu Leu Thr Ala Trp Thr Leu Met Ala Leu Val Thr 
50 55 60 

get tta gtg gga tgg gat aat agg tat ggt tec ttc ttg teg tta tta 239 
Ala Leu Val Gly Trp Asp Asn Arg Tyr Gly Ser Phe Leu Ser Leu Leu 
65 70 75 

ata tta tta ttc cag ctt ggt tea age gca gga act tac cca ata gaa 287 
He Leu Leu Phe Gin Leu Gly Ser Ser Ala Gly Thr Tyr Pro He Glu 
80 85 90 " 95 

ttg agt cct aag ttc ttt caa aca att caa cca ttt tta ccg atg act 335 
Leu Ser Pro Lys Phe Phe Gin Thr He Gin Pro Phe Leu Pro Met Thr 
100 105 110 

tac tct gtt tea gga tta aga gag acc ate teg ttg acg gga gac gtt 383 
Tyr Ser Val Ser Gly Leu Arg Glu Thr He Ser Leu Thr Gly Asp Val 
115 120 125 

aac cat caa tgg aga atg eta gta ate ttt tta gta tea teg atg ata 431 
Asn His Gin Trp Arg Met Leu Val He Phe Leu Val Ser Ser Met He 
130 135 140 

ctt get ctt ctt att tat cgt aaa caa gaa gat taatagaaag tatctagtga 484 
Leu Ala Leu Leu He Tyr Arg Lys Gin Glu Asp 
145 150 

tagactaaca gtatgatatg gtatgtcaaa gtatttagga ggagaagat atg tct act 542 

Met Ser Thr 
155 

tta aca ata att att gca aca tta act get ttg gaa cat ttt tat att 590 
Leu Thr He He He Ala Thr Leu Thr Ala Leu Glu His Phe Tyr He 
160 165 170 

atg tat ttg gag acg tta gee acc cag tea aat atg act ggg aag att 638 
Met Tyr Leu Glu Thr Leu Ala Thr Gin Ser Asn Met Thr Gly Lys lie 
175 180 185 

ttt agt atg tct aaa gaa gag ttg tea tat tta ccc gtt att aaa ctt 686 
Phe Ser Met Ser Lys Glu Glu Leu Ser Tyr Leu Pro Val He Lys Leu 
190 195 200 205 

ttt aag aat caa ggt gta tac aac ggc ttg att ggc eta ttc etc ctt 734 
Phe Lys Asn Gin Gly Val Tyr Asn Gly Leu He Gly Leu Phe Leu Leu 
210 215 220 
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tat ggg tta tat att tea cag aat caa gaa att gta get gtt ttt tta 782 
Tyr Gly Leu Tyr lie Ser Gin Asn Gin Glu lie Val Ala Val Phe Leu 
225 230 235 

ate aat gta ttg eta gtt get att tat ggt get ttg aca gtt gat aaa 830 
lie Asn Val Leu Leu Val Ala He Tyr Gly Ala Leu Thr Val Asp Lys 
240 245 250 

aaa ate tta tta aaa cag ggt ggt tta cct ata tta get ctt tta aca 878 
Lys He Leu Leu Lys Gin Gly Gly Leu Pro He Leu Ala Leu Leu Thr 
255 260 265 

ttc tta ttt taatactact tagcegtteg atttagttga aeggctttta 927 
Phe Leu Phe 

270 

gtaatcattt ttttctcata atacaggtag tttaagtaat ttgtctttaa aaatagtata 987 
atataactac gaattcaaag agaggtgact ttgatt atg act gag aac tgg tta 1041 

Met Thr Glu Asn Trp Leu 
275 

cat act aaa gat ggt tea gat att tat tat cgt gtc gtt ggt caa ggt 1089 
His Thr Lys Asp Gly Ser Asp He Tyr Tyr Arg Val Val Gly Gin Gly 
280 285 290 

caa ccg att gtt ttt tta cat ggc aat age tta agt agt cgc tat ttt 1137 
Gin Pro He Val Phe Leu His Gly Asn Ser Leu Ser Ser Arg Tyr Phe 
295 300 305 310 

gat aag caa ata gca tat ttt tct aag tat tac caa gtt att gtt atg 1185 
Asp Lys Gin He Ala Tyr Phe Ser Lys Tyr Tyr Gin Val He Val Met 
315 320 325 

gat agt aga ggg cat ggc aaa agt cat gca aag eta aat ace att agt 1233 
Asp Ser Arg Gly His Gly Lys Ser His Ala Lys Leu Asn Thr He Ser 
330 335 340 

ttc agg caa ata gca gtt gac tta aag gat ate tta gtt cat tta gag 1281 
Phe Arg Gin He Ala Val Asp Leu Lys Asp He Leu Val His Leu Glu 
345 350 355 

att gat aaa gtt ata ttg gta ggc cat age gat ggt gee aat tta get 1329 
He Asp Lys Val He Leu Val Gly His Ser Asp Gly Ala Asn Leu Ala 
360 365 370 

tta gtt ttt caa acg atg ttt cca ggt atg gtt aga ggg ctt ttg ctt 1377 
Leu Val Phe Gin Thr Met Phe Pro Gly Met Val Arg Gly Leu Leu Leu 
375 380 385 ~ * 390 

aat tea ggg aac ctg act att cat ggt cag cga tgg tgg gat att ctt 1425 
Asn Ser Gly Asn Leu Thr He His Gly Gin Arg Trp Trp Asp He Leu 
395 400 405 

tta gta agg att gee tat aaa ttc ctt cac tat tta ggg aaa etc ttt 1473 
Leu val Arg He Ala Tyr Lys Phe Leu His Tyr Leu Gly Lys Leu Phe 
410 415 420 
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ccg tat atg agg caa aaa get caa gtt att teg ctt atg ttg gag gat 1521 
Pro Tyr Met Arg Gin Lys Ala Gin Val lie Ser Leu Met Leu Glu Asp 
425 430 435 

ttg aag att agt cca get gat tta cag cat gtg tea act cct gta atg 1569 
Leu Lys lie Ser Pro Ala Asp Leu Gin His Val Ser Thr Pro Val Met 
440 445 450 

gtt ttg gtt gga aat aag gac ata att aag tta aat cat tct aag aaa 1617 
Val Leu Val Gly Asn Lys Asp lie He Lys Leu Asn His Ser Lys Lys 
455 460 465 470 

ctt get tct tat ttt cca agg ggg gag ttt tat tct tta gtt ggc ttt 1665 
Leu Ala Ser Tyr Phe Pro Arg Gly Glu Phe Tyr Ser Leu Val Gly Phe 
475 480 485 

ggg cat cac att att aag caa gat tec cat gtt ttt aat att att gca 1713 
Gly His His He He Lys Gin Asp Ser His Val Phe Asn He He Ala 
490 495 500 

aaa aag ttt ate aac gat acg ttg aaa gga gaa att gtt gaa aaa get 1761 
Lys Lys Phe He Asn Asp Thr Leu Lys Gly Glu He Val Glu Lys Ala 
505 510 515 

aat tga aaaagtcaaa tcactgactt ctgtgattaa aattgtattt tttatatctg 1817 
Asn * 



ttttagtgct tattattgtt gaa atg att cat ttg aaa cga act att tct gtt 1870 

Met He His Leu Lys Arg Thr He Ser Val 
520 525 

gag caa eta aag agt gtt ttt ggg caa tta tct cca atg aat ctt ttc 1918 
Glu Gin Leu Lys Ser Val Phe Gly Gin Leu Ser Pro Met Asn Leu Phe 
530 535 540 545 

tta att ate ctt gtg ggg gtt ate get gtc tta ccg aca acc gga tat 1966 
Leu He He Leu Val Gly Val He Ala Val Leu Pro Thr Thr Gly Tyr 
550 555 560 

gac ttt gta ctg aat gga ctt tta cgt aca gat aaa age aaa agg tat 2014 
Asp Phe Val Leu Asn Gly Leu Leu Arg Thr Asp Lys Ser Lys Arg Tyr 
565 570 575 

att tta cag act agt tgg tgt ate aac act ttt aat aac ttg tea gga 2062 
He Leu Gin Thr Ser Trp Cys He Asn Thr Phe Asn Asn Leu Ser Gly 
580 585 590 

ttc ggt ggc tta ate gat att ggg ttg cgc atg get ttt tat ggt aaa 2110 
Phe Gly Gly Leu He Asp lie Gly Leu Arg Met Ala Phe Tyr Gly Lys 
595 600 ~ 605 

aaa ggt caa gag aag agt gac eta aga gaa gtg act cgt ttt tta ccc 2158 
Lys Gly Gin Glu Lys Ser Asp Leu Arg Glu Val Thr Arg Phe Leu Pro 
610 615 620 625 
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tat ctt att tct ggt ctg tea ttt att agt gtg att gec tta ate atg 2206 
Tyr Leu lie Ser Gly Leu Ser Phe He Ser Val He Ala Leu He Met 
630 635 640 

age cat att ttt cat gec aaa get agt gtt gat tac tat tat ttg gta 2254 
Ser His He Phe His Ala Lys Ala Ser Val Asp Tyr Tyr Tyr Leu Val 
645 650 * 655 

tta att ggt get agt atg tat ttt cct gtt att tat tgg att tct ggt 2302 
Leu lie Gly Ala Ser Met Tyr Phe Pro Val He Tyr Trp He Ser Gly 
660 665 670 

cat aaa gga age cat tat ttc gga gat atg cca tct agt act cgt ata 2350 
His Lys Gly Ser His Tyr Phe Gly Asp Met Pro Ser Ser Thr Arg He 
675 680 685 

aaa tta ggt gtt gtt tct ttt ttt gaa tgg gga tgt gcg gec gca gca 2398 
Lys Leu Gly Val Val Ser Phe Phe Glu Trp Gly Cys Ala Ala Ala Ala 
690 695 700 705 

ttt ata att ate ggt tat tta atg ggc att cat eta cca gtt tat aaa 2446 
Phe He He He Gly Tyr Leu Met Gly He His Leu Pro Val Tyr Lys 
710 715 720 

att tta cca eta ttt tgt att ggt tgt gec gtc ggg att gta tec ctt 2494 
He Leu Pro Leu Phe Cys He Gly Cys Ala Val Gly He Val Ser Leu 
725 730 735 

att ccc ggt gga tta gga agt ttt gaa tta gtt eta ttt aca ggg ttt , 2542 
He Pro Gly Gly Leu Gly Ser Phe Glu Leu Val Leu Phe Thr Gly Phe 
740 745 750 

get gee gag gga eta cct aaa gaa act gtg gtt gca tgg tta tta ctt 2590 
Ala Ala Glu Gly Leu Pro Lys Glu Thr Val Val Ala Trp Leu Leu Leu 
755 760 765 

tat cgt tta gee tac tat att att cca ttc ttt gca ggt ate tat ttc 2638 
Tyr Arg Leu Ala Tyr Tyr He lie Pro Phe Phe Ala Gly lie Tyr Phe 
770 775 780 785 

ttt ate cat tat tta ggt agt caa ata aat caa cgt tat gaa aat gtc 2686 
Phe lie His Tyr Leu Gly Ser Gin He Asn Gin Arg Tyr Glu Asn Val 
790 795 800 

ccg aaa gag tta gta tea act gtt eta caa ace atg gtg age cat ttg 2734 
Pro Lys Glu Leu Val Ser Thr Val Leu Gin Thr Met Val Ser His Leu 
805 810 815 

atg cgt att tta ggt gca ttc tta ata ttt tea aca gca ttt ttt gaa 2782 
Met Arg He Leu Gly Ala Phe Leu lie Phe Ser Thr Ala Phe Phe Glu 
820 825 830 

aat att act tat att atg tgg ttg cag aag eta ggc ttg gac cca tta 2830 
Asn lie Thr Tyr He Met Trp Leu Gin Lys Leu Gly Leu Asp Pro Leu 
835 840 * 845 



WO 99/42588 



PCT/CA99/00U4 



6/63 



caa gaa caa atg eta tgg cag ttt cca ggt tta ttg ctg ggg gtt tgt 
Gin Glu Gin Met Leu Trp Gin Phe Pro Gly Leu Leu Leu Gly Val Cys 
850 855 860 865 



2878 



ttt att etc tta get aga act att gat caa aaa gtg aaa aat get ttt 
Phe lie Leu Leu Ala Arg Thr He Asp Gin Lys Val Lys Asn Ala Phe 
870 875 880 



2926 



cca att get att ate tgg att act ttg aca ttg ttt tat ctt aat tta 
Pro He Ala He He Trp He Thr Leu Thr Leu Phe Tyr Leu Asn Leu 
885 890 895 



2974 



ggt cat att agt tgg cga eta tct ttc tgg ttt att tta eta ttg tta 
Gly His He Ser Trp Arg Leu Ser Phe Trp Phe He Leu Leu Leu Leu 
900 905 910 



3022 



ggc tta tta gtc att aag cca act etc tat aaa aaa caa ttt att tat 
Gly Leu Leu Val He Lys Pro Thr Leu Tyr Lys Lys Gin Phe He Tyr 
915 920 925 



3070 



age tgg gaa gag cgt att aag gat gga ate att ate gtt agt tta atg 
Ser Trp Glu Glu Arg He Lys Asp Gly He He He Val Ser Leu Met 
930 935 940 945 



3118 



gga gtt eta ttt tat att gca gga eta eta ttc cct ate agg get cat 
Gly Val Leu Phe Tyr He Ala Gly Leu Leu Phe Pro He Arg Ala His 
950 955 960 



3166 



att aca ggt ggt agt att gaa cgc ctg cat tat ate ata gca tgg gag 
He Thr Gly Gly Ser He Glu Arg Leu His Tyr He He Ala Trp Glu 
965 970 975 



3214 



ccg ata gca ttg get acg ttg att ctt act etc gtt tat tta tgt ttg 3262 
Pro He Ala Leu Ala Thr Leu He Leu Thr Leu Val Tyr Leu Cys Leu 
980 985 990 

gtt aag att tta caa gga aaa tct tgt cag att ggt gat gtg ttc aat 3310 
Val Lys He Leu Gin Gly Lys Ser Cys Gin He Gly Asp Val Phe Asn 
995 1000 1005 



gtg gat cgt tat aaa aaa eta ctt caa get tac ggt ggt tct teg gat 
Val Asp Arg Tyr Lys Lys Leu Leu Gin Ala Tyr Gly Gly Ser Ser Asp 
1010 1015 1020 1025 



3358 



age ggt tta gec ttt tta aat gat aaa agg etc tac tgg tac caa aaa 
Ser Gly Leu Ala Phe Leu Asn Asp Lys Arg Leu Tyr Trp Tyr Gin Lys 
1030 1035 1040 



3406 



aat gga gaa gat tgc gtt gcg ttc caa ttt gta att gtc aat aat aaa 
Asn Gly Glu Asp Cys Val Ala Phe Gin Phe Val He Val Asn Asn Lys 
1045 1050 1055 



3454 



tgt ctt att atg ggg gaa cca gee ggt gat gac act tat att cgt gaa 
Cys Leu He Met Gly Glu Pro Ala Gly Asp Asp Thr Tyr He Arg Glu 
1060 1065 1070 



3502 
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get att gaa teg ttt att gat gat get gat aag eta gac tat gac ctt 3550 
Ala He Glu Ser Phe He Asp Asp Ala Asp Lys Leu Asp Tyr Asp Leu 
1075 1080 1085 

gtt ttt tac agt att gga cag aag ttg aca eta ctt tta cat gag tat 3598 
Val Phe Tyr Ser He Gly Gin Lys Leu Thr Leu Leu Leu His Glu Tyr 
1090 1095 iioo 1105 

ggt ttt gac ttt atg aaa gtt ggt gag gat get tta gtt aat tta gaa 3646 
Gly Phe Asp Phe Met Lys Val Gly Glu Asp Ala Leu Val Asn Leu Glu 
1110 ins 1120 

acg ttt act ctt aaa ggg aat aag tac aaa cct ttc aga aat gec eta 3694 
Thr Phe Thr Leu Lys Gly Asn Lys Tyr Lys Pro Phe Arg Asn Ala Leu 
1125 H30 H35 

aat aga gtt gaa aag gat ggt ttc tat ttc gaa gtt gta caa teg cca 3742 
Asn Arg Val Glu Lys Asp Gly Phe Tyr Phe Glu Val Val Gin Ser Pro 
1140 1145 liso 

cat agt caa gag eta eta aat agt ttg gaa gag att tct aat act tgg 3790 
His Ser Gin Glu Leu Leu Asn Ser Leu Glu Glu He Ser Asn Thr Trp 
H55 1160 H65 

tta gaa gga cgt cct gaa aaa ggt ttc tea eta gga tat ttt aat aaa 3838 
Leu Glu Gly Arg Pro Glu Lys Gly Phe Ser Leu Gly Tyr Phe Asn Lys 
H70 1175 H80 ' ' H85 

gat tat ttc caa caa gee cca ata get ttg gta aaa aat get gaa cac 3886 
Asp Tyr Phe Gin Gin Ala Pro lie Ala Leu Val Lys Asn Ala Glu His 
H90 1195 ' 1200 

gaa gtt gtt get ttt get aat att atg cca aac tat gaa aag agt att 3934 
Glu Val Val Ala Phe Ala Asn He Met Pro Asn Tyr Glu Lys Ser He 
1205 1210 1215 

ate tct att gat tta atg cgt cac gat aaa cag aaa att ccg aat ggc 3982 
He Ser He Asp Leu Met Arg His Asp Lys Gin Lys He Pro Asn Gly 
1220 1225 1230 

gtt atg gat ttc etc ttt tta tea tta ttc tct tat tat caa gag aag 4030 
Val Met Asp Phe Leu Phe Leu Ser Leu Phe Ser Tyr Tyr Gin Glu Lys 
1235 1240 1245 

gga tac cac tat ttt gat ttg ggg atg gca cct tta tea gga gtt ggt 4078 
Gly Tyr His Tyr Phe Asp Leu Gly Met Ala Pro Leu Ser Gly Val Gly 
1250 1255 1260 1265 

cgc gtt gaa aca agt ttt get aaa gag aga atg gcg tat ctt gtc tat 4126 
Arg Val Glu Thr Ser Phe Ala Lys Glu Arg Met Ala Tyr Leu Val Tyr 
1270 1275 1280 

cat ttc ggt agt cat ttc tac tea ttt aat ggt tta cac aag tat aag 4174 
His Phe Gly Ser His Phe Tyr Ser Phe Asn Gly Leu His Lys Tyr Lys 
1285 1290 1295 
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aag aag ttt aca cca ttg tgg teg gaa cgt tat att tct tgt tct cgt 4222 
Lys Lys Phe Thr Pro Leu Trp Ser Glu Arg Tyr lie Ser Cys Ser Arg 
1300 1305 1310 

teg tec tgg tta att tgt get att tgt gec eta tta atg gaa gat agt 4270 
Ser Ser Trp Leu He Cys Ala He Cys Ala Leu Leu Met Glu Asp Ser 
1315 1320 1325 

aaa att aag att gtt aaa taagctttat ttggcaatta aaaagagcat 4318 
Lys He Lys He Val Lys 
1330 1335 

gtcatgegae atgctctttt taaatcattt aataccattg attgettgaa tctactttat 4378 

aatatgatgt gcttttaaat attgtttagc tactgtagct gctgatttat gctttacagc 4438 

tacttggtag ttcatttctt gcatttcttt ttcagtgata tgaccagcaa gtttattgag 4498 

agcttttttt acttga 4514 
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Met Arg lie Leu 
1 

Asn lie Thr Tyr 
20 

Gin Glu Gin Met 
35 

Phe lie Leu Leu 
50 

Pro He Ala He 
65 

Gly His He Ser 

Gly Leu Leu Val 
100 

Ser Trp Glu Glu 
115 

Gly Val Leu Phe 
130 

He Thr Gly Gly 
145 

Pro He Ala Leu 

Val Lys lie Leu 
180 

Val Asp Arg Tyr 
195 

Ser Gly Leu Ala 
210 

Asn Gly Glu Asp 
225 

Cys Leu He Met 

Ala He Glu Ser 
260 

Val Phe Tyr Ser 
275 

Gly Phe Asp Phe 
290 

Thr Phe Thr Leu 
305 

Asn Arg Val Glu 
His Ser Gin Glu 

340 

Leu Glu Gly Arg 
355 

Asp Tyr Phe Gin 
370 

Glu Val Val Ala 
385 

He Ser lie Asp 

Val Met Asp Phe 
420 

Gly Tyr His Tyr 
435 



Gly 


Ala 


pne 


Leu 


5 








lie 


Met 


Trp 


Leu 


Leu 


Trp 


Gin 


Phe 








ft U 


Aid 


Arg 


inr 


lie 










lie 


Trp 


Tl- 
ilS 


Thr 




"7 ft 






Trp 


Arg 


Leu 


Ser 


85 








He 


Lys 


Pro 


Thr 


Arg 


lie 


Lys 


Asp 








ion 


Tyr 


Ti- 
ne 


Ala 


Gly 






tic; 

1j j 




Ser 


Tin 

lie 


G1U 


Arg 




lbU 






ax a 


inr 


Leu 


xi- 
iie 


165 








Gin 


Gly 


Lys 


Ser 


Lys 


Lys 


Leu 


Leu 








0 ft ft 


Phe 


Leu 


Asn 


Asp 






9 1 ^ 




Cys 


vai 


Aia 


pne 










biy 




Pro 


Ala 


245 








Phe 


lie 


Asp 


Asp 


Tin 
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Glu 
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Gly 








360 


Gin 


Ala 


Pro 
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Arg 


His 


405 
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440 
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lbb 




Leu 


Thr 


Leu 


Val 




1 / u 






Cys 


Gin 


He 


Gly 
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He 
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Asp 
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Gin 


Lys 




410 






Leu 


Phe 


Ser 


Tyr 


425 








Met 


Ala 


Pro 


Leu 



Ala Phe Phe Glu 
15 

Leu Asp Pro Leu 
30 

Leu Gly Val Cys 
45 

Lys Asn Ala Phe 

Tyr Leu Asn Leu 
80 

Leu Leu Leu Leu 
95 

Gin Phe lie Tyr 
110 

Val Ser Leu Met 
125 

lie Arg Ala His 

lie Ala Trp Glu 
160 

Tyr Leu Cys Leu 
175 

Asp Val Phe Asn 
190 

Gly Ser Ser Asp 
205 

Trp Tyr Gin Lys 

Val Asn Asn Lys 
240 

Tyr lie Arg Glu 
255 

Asp Tyr Asp Leu 
270 

Leu His Glu Tyr 
285 

Val Asn Leu Glu 

Arg Asn Ala Leu 
320 

Val Gin Ser Pro 
335 

Ser Asn Thr Trp 
350 

Tyr Phe Asn Lys 
365 

Asn Ala Glu His 

Glu Lys Ser lie 
400 

He Pro Asn Gly 
415 

Tyr Gin Glu Lys 
430 

Ser Gly Val Gly 
445 
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Arg 


Val 


Glu 


Thr 


Ser 


Phe 


Ala 


Lys Glu Arg Met Ala Tyr Leu Val Tyr 




450 










455 






460 


His 


Phe 


Gly 


Ser 


His 


Phe 


Tyr 


Ser 


Phe 


Asn Gly Leu His Lys Tyr Lys 


465 










470 
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Lys 
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Phe 


Thr 
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Trp 


Ser 
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Arg Tyr He Ser Cys Ser Arg 
490 495 


Ser 
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He Cys Ala 


He 
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Ala Leu Leu Met Glu Asp Ser 
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He 
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He 


Val 
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515 



<210> 7 
<211> 5126 
<212> DNA 

<213> Streptococcus 

<220> 

<221> CDS 

<222> {!)... (687) 

<221> CDS 

<222> (701) . . . (2557) 
<221> CDS 

<222> (2566) . . . (3036) 
<221> CDS 

<222> (3106) . . . (4842) 
<221> CDS 

<222> (4850) . . . (5125) 
<400> 7 

aat ttt gat ate gaa aca aca act ttt gag gca atg aaa aag cac gcg 48 
Asn Phe Asp He Glu Thr Thr Thr Phe Glu Ala Met Lys Lys His Ala 
15 10 15 

tea tta ttg gag aaa ata tct gtt gag cgt tct ttt att gaa ttt gat 96 
Ser Leu Leu Glu Lys He Ser Val Glu Arg Ser Phe He Glu Phe Asp 
20 25 30 

aaa ctt eta tta gca cct tat tgg cgt aaa gga atg ctg gca eta ata 144 
Lys Leu Leu Leu Ala Pro Tyr Trp Arg Lys Gly Met Leu Ala Leu He 
35 40 45 

gat agt cat get ttt aat tat eta cca tgc tta aaa aat agg gaa tta 192 
Asp Ser His Ala Phe Asn Tyr Leu Pro Cys Leu Lys Asn Arg Glu Leu 
50 55 " 60 

caa tta age gee ttt ttg tec cag tta gat aaa gat ttt tta ttt gag 240 
Gin Leu Ser Ala Phe Leu Ser Gin Leu Asp Lys Asp Phe Leu Phe Glu 
65 70 75 80 

aca tea gaa caa get tgg gca tea etc ate ttg agt atg gaa gtt gaa 2 88 
Thr Ser Glu Gin Ala Trp Ala Ser Leu He Leu Ser Met Glu Val Glu 
85 90 95 
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cac aca aag act ttt tta aaa aaa tgg aag aca tea act cac ttt caa 336 
His Thr Lys Thr Phe Leu Lys Lys Trp Lys Thr Ser Thr His Phe Gin 
100 105 no 

aaa gat gtt gag cat ata gtg gat gtt tat cgt att cgt gaa caa atg 3B4 
Lys Asp Val Glu His lie Val Asp Val Tyr Arg He Arg Glu Gin Met 
115 120 125 

gga ttg get aaa gaa cat ctt tat cgt tat gga aaa act ata ata aaa 432 
Gly Leu Ala Lys Glu His Leu Tyr Arg Tyr Gly Lys Thr He He Lys 
130 135 140 

caa gcg gaa ggt att cgc aaa gca aga ggc ttg atg gtt gat ttc gaa 480 
Gin Ala Glu Gly He Arg Lys Ala Arg Gly Leu Met Val Asp Phe Glu 
145 150 155 160 

aaa ata gaa caa eta gat agt gag tta gca ate cat gat agg cat gag 528 
Lys He Glu Gin Leu Asp Ser Glu Leu Ala He His Asp Arg His Glu 
165 170 175 

ata gtt gtc aat ggt ggc acc tta ate aag aaa tta gga ata aaa cct 576 
He Val Val Asn Gly Gly Thr Leu He Lys Lys Leu Gly He Lys Pro 
180 185 190 

ggt cca cag atg gga gat att ate tct caa att gaa tta gee att gtt 624 
Gly Pro Gin Met Gly Asp He He Ser Gin He Glu Leu Ala He Val 
195 200 205 



tta gga caa ctg att aat gaa gaa gag get att tta cat ttt gtt aag 
Leu Gly Gin Leu He Asn Glu Glu Glu Ala He Leu His Phe Val Lys 
210 215 220 



672 



cag tac ttg atg gat tagagaggat tat atg age gat ttt tta gta gat 721 
Gin Tyr Leu Met Asp Met Ser Asp Phe Leu Val Asp 

225 230 235 

gga ttg act aag teg gtt ggt gat aag acg gtc ttt agt aat gtt tea 769 
Gly Leu Thr Lys Ser Val Gly Asp Lys Thr Val Phe Ser Asn Val Ser 
240 245 250 

ttt ate ate cat agt tta gac cgt att ggg att att ggt gtc aat gga 817 
Phe lie He His Ser Leu Asp Arg He Gly He He Gly Val Asn Gly 
255 260 265 

act gga aag aca aca eta tta gat gtt att teg ggt gaa tta ggt ttt 865 
Thr Gly Lys Thr Thr Leu Leu Asp Val He Ser Gly Glu Leu Gly Phe 
270 275 280 

gat ggt gat cgt tec cct ttt tea tea get aat gat tat aag att get 913 
Asp Gly Asp Arg Ser Pro Phe Ser Ser Ala Asn Asp Tyr Lys He Ala 
285 290 295 300 

tat tta aaa caa gaa cca gac ttt gat gat tct cag aca att ttg gac 961 
Tyr Leu Lys Gin Glu Pro Asp Phe Asp Asp Ser Gin Thr He Leu Asp 
305 310 315 
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acc gta ctt tct tct gac tta aga gag atg get tta att aaa gaa tat 1009 
Thr Val Leu Ser Ser Asp Leu Arg Glu Met Ala Leu lie Lys Glu Tyr 
320 325 330 

gaa tta ttg ctt aat cac tac gaa gaa agt aag caa tea cgt eta gag 1057 
Glu Leu Leu Leu Asn His Tyr Glu Glu Ser Lys Gin Ser Arg Leu Glu 
335 340 345 

aaa gta atg gca gaa atg gat tct tta gat get tgg tct att gag age 1105 
Lys Val Met Ala Glu Met Asp Ser Leu Asp Ala Trp Ser He Glu Ser 
350 355 360 

gaa gtc aaa aca gta tta tec aaa tta ggt att act gat ttg cag ttg 1153 
Glu Val Lys Thr Val Leu Ser Lys Leu Gly He Thr Asp Leu Gin Leu 
365 370 375 380 

teg gtt ggt gaa tta tea gga gga tta cga aga cgt gtt caa tta gcg 1201 
Ser Val Gly Glu Leu Ser Gly Gly Leu Arg Arg Arg Val Gin Leu Ala 
385 390 395 

caa gta tta tta aat gat gca gat tta ttg etc tta gac gaa cct act 1249 
Gin Val Leu Leu Asn Asp Ala Asp Leu Leu Leu Leu Asp Glu Pro Thr 
400 405 * 410 

aac cac tta gat att gac act att gca tgg tta acg aat ttt ttg aaa 1297 
Asn His Leu Asp He Asp Thr He Ala Trp Leu Thr Asn Phe Leu Lys 
415 420 425 

aat agt aaa aag aca gtg ctt ttt ata act cat gat cgt tat ttt eta 1345 
Asn Ser Lys Lys Thr Val Leu Phe He Thr His Asp Arg Tyr Phe Leu 
430 435 440 

gac aat gtt gca aca cgt att ttt gaa tta gat aag gca cag att aca 1393 
Asp Asn Val Ala Thr Arg He Phe Glu Leu Asp Lys Ala Gin He Thr 
445 450 455 460 

gaa tat caa ggc aat tat cag gat tat gtc cga ctt cgt gca gaa caa 1441 
Glu Tyr Gin Gly Asn Tyr Gin Asp Tyr Val Arg Leu Arg Ala Glu Gin 
465 470 ~ 475 

gac gag cgt gat get get agt tta cat aaa aag aaa cag ctt tat aaa 1489 
Asp Glu Arg Asp Ala Ala Ser Leu His Lys Lys Lys Gin Leu Tyr Lys 
480 485 490 

cag gaa eta get tgg atg cgt act cag cca caa get cgt gca acg aaa 1537 
Gin Glu Leu Ala Trp Met Arg Thr Gin Pro Gin Ala Arg Ala Thr Lys 
495 500 505 

caa cag get cgt att aat cgt ttt caa aat eta aaa aac gat tta cac 1585 
Gin Gin Ala Arg He Asn Arg Phe Gin Asn Leu Lys Asn Asp Leu His 
510 515 520 

caa aca age gat aca age gat ttg gaa atg aca ttt gaa aca agt cga 1633 
Gin Thr Ser Asp Thr Ser Asp Leu Glu Met Thr Phe Glu Thr Ser Arg 
525 530 535 540 
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att ggg aaa aag gtt att aat ttt gaa aat gtc tct ttt tct tac cca 1681 
He Gly Lys Lys Val He Asn Phe Glu Asn Val Ser Phe Ser Tyr Pro 
545 550 555 

gat aaa tct ate ttg aaa gac ttt aat ttg tta att caa aat aaa gac 1729 
Asp Lys Ser He Leu Lys Asp Phe Asn Leu Leu He Gin Asn Lys Asp 
560 565 570 

cgt att ggc ate gtt gga gat aat ggt gtt gga aag tea ace tta ctt 1777 
Arg He Gly He Val Gly Asp Asn Gly Val Gly Lys Ser Thr Leu Leu 
575 580 " 585 

aat tta att gtt caa gat tta cag ccg gat teg ggt aat gtc tct att 1825 
Asn Leu He Val Gin Asp Leu Gin Pro Asp Ser Gly Asn Val Ser He 
590 595 600 

ggt gaa acg ata cgt gta ggt tac ttt tea caa caa ctt cat aat atg 1873 
Gly Glu Thr He Arg Val Gly Tyr Phe Ser Gin Gin Leu His Asn Met 
605 610 615 620 

gat ggc tea aaa cgt gtt att aat tat ttg caa gag gtt gca gat gag 1921 
Asp Gly Ser Lys Arg Val He Asn Tyr Leu Gin Glu Val Ala Asp Glu 
625 630 635 

gtt aaa act agt gtc ggt aca aca agt gtg aca gaa eta ttg gaa caa 1969 
Val Lys Thr Ser Val Gly Thr Thr Ser Val Thr Glu Leu Leu Glu Gin 
640 645 650 

ttt etc ttt cca cgt teg aca cat gga aca caa att gca aaa tta tea 2017 
Phe Leu Phe Pro Arg Ser Thr His Gly Thr Gin He Ala Lys Leu Ser 
655 660 665 

ggt ggt gag aaa aaa aga ctt tac ctt tta aaa ate ctg att gaa aag 2065 
Gly Gly Glu Lys Lys Arg Leu Tyr Leu Leu Lys He Leu He Glu Lys 
670 675 680 

cct aat gtg tta eta ctt gat gag ccg aca aat gac tta gat att get 2113 
Pro Asn Val Leu Leu Leu Asp Glu Pro Thr Asn Asp Leu Asp He Ala 
685 690 695 700 

aca tta act gtt ctt gaa aat ttt tta caa ggc ttt ggt ggt cct gtg 2161 
Thr Leu Thr Val Leu Glu Asn Phe Leu Gin Gly Phe Gly Gly Pro Val 
705 710 715 

att aca gtt agt cac gat cgt tac ttt tta gat aaa gtg get aat aaa 2209 
He Thr Val Ser His Asp Arg Tyr Phe Leu Asp Lys Val Ala Asn Lys 
720 725 730 

att att gcg ttt gaa gat aac gat ate cgt gaa ttt ttt ggt aat tat 2257 
He He Ala Phe Glu Asp Asn Asp He Arg Glu Phe Phe Gly Asn Tyr 
735 740 745 

act gat tat tta gat gaa aaa gca ttt aat gag caa aat aat gaa gtt 2305 
Thr Asp Tyr Leu Asp Glu Lys Ala Phe Asn Glu Gin Asn Asn Glu Val 
750 755 760 
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ate agt aaa aaa gag agt acc aag aca agt cgt gaa aag caa agt cgt 2353 
lie Ser Lys Lys Glu Ser Thr Lys Thr Ser Arg Glu Lys Gin Ser Arg 
765 770 775 780 

aaa aga atg tct tac ttt gaa aaa caa gaa tgg gcg aca att gaa gac 2401 
Lys Arg Met Ser Tyr Phe Glu Lys Gin Glu Trp Ala Thr lie Glu Asp 
785 790 795 

gat att atg ata ttg gaa aat act ate act cgt ata gaa aat gat atg 2449 
Asp lie Met lie Leu Glu Asn Thr lie Thr Arg lie Glu Asn Asp Met 
800 805 810 

caa aca tgt ggt agt gat ttt aca agg tta tct gat tta caa aag gaa 24 97 
Gin Thr Cys Gly Ser Asp Phe Thr Arg Leu Ser Asp Leu Gin Lys Glu 
815 820 825 

tta gat gca aaa aat gaa gca ctt eta gaa aag tat gac cgt tat gag 2545 
Leu Asp Ala Lys Asn Glu Ala Leu Leu Glu Lys Tyr Asp Arg Tyr Glu 
830 835 840 

tac ctt agt gag ttagacac atg att ate cgt ccg att att aaa aat gat 2595 
Tyr Leu Ser Glu LeuAspThrMet He He Arg Pro He He Lys Asn Asp 
845 850 855 860 

gac caa gca gtt gca caa tta att cga caa agt tta cgc gee tat gat 2643 
Asp Gin Ala Val Ala Gin Leu He Arg Gin Ser Leu Arg Ala Tyr Asp 
865 870 875 

tta gat aaa cct gat aca gca tat tea gac cct cac tta gat cat ttg 2691 
Leu Asp Lys Pro Asp Thr Ala Tyr Ser Asp Pro His Leu Asp His Leu 
880 885 890 

acc tea tac tac gaa aaa ata gag aag tea gga ttc ttt gtc att gag 2739 
Thr Ser Tyr Tyr Glu Lys He Glu Lys Ser Gly Phe Phe Val He Glu 
895 900 905 

gag aga gat gag att att ggc tgt ggc ggc ttt ggt ccg ctg aaa aat 2787 
Glu Arg Asp Glu He He Gly Cys Gly Gly Phe Gly Pro Leu Lys Asn 
910 915 920 925 

eta att gca gag atg cag aag gtg tac att gca gaa cgt ttc cgt ggt 2835 
Leu He Ala Glu Met Gin Lys Val Tyr He Ala Glu Arg Phe Arg Gly 
930 935 940 

aag ggg ctt get act gat tta gtg aaa atg att gaa gta gaa get cga 2883 
Lys Gly Leu Ala Thr Asp Leu Val Lys Met He Glu Val Glu Ala Arg 
945 950 955 

aaa att ggg tat aga caa ctt tat tta gag aca gee agt act ttg agt 2931 
Lys He Gly Tyr Arg Gin Leu Tyr Leu Glu Thr Ala Ser Thr Leu Ser 
960 965 970 

agg gca act gcg gtt tat aag cat atg gga tat tgt gee tta teg caa 2979 
Arg Ala Thr Ala Val Tyr Lys His Met Gly Tyr Cys Ala Leu Ser Gin 
975 980 985 
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cca ata gca aat gat caa ggt cat aca get atg gat att tgg atg att 3027 
Pro He Ala Asn Asp Gin Gly His Thr Ala Met Asp He Trp Met He 
990 995 1000 1005 

aaa gat tta taagttgaaa gtggattagt gaacatggat taattatttt 3076 
Lys Asp Leu 



gagataagag gaaagaaaag gagacatat atg gca tat att tgg tct tat ttg 3129 

Met Ala Tyr He Trp Ser Tyr Leu 
1010 1015 

aaa agg tac ccc aat tgg tta tgg ctt gat tta eta gga get atg ctt 3177 
Lys Arg Tyr Pro Asn Trp Leu Trp Leu Asp Leu Leu Gly Ala Met Leu 
1020 1025 1030 

ttt gtg acg gtt ate eta gga atg ccc aca gee tta gcg ggt atg att 3225 
Phe Val Thr Val He Leu Gly Met Pro Thr Ala Leu Ala Gly Met He 
1035 1040 1045 

gat aat ggc gtt aca aaa ggt gat egg act gga gtt tat ctg tgg acg 3273 
Asp Asn Gly Val Thr Lys Gly Asp Arg Thr Gly Val Tyr Leu Trp Thr 
1050 1055 1060 

ttc ate atg ttt ata ttt gtt gta eta ggt att att ggg cgt att acg 3321 
Phe He Met Phe He Phe Val Val Leu Gly He He Gly Arg He Thr 
1065 1070 1075 1080 

atg get tac gca tct agt cgc tta acg aca aca atg att aga gat atg 3369 
Met Ala Tyr Ala Ser Ser Arg Leu Thr Thr Thr Met He Arg Asp Met 
1085 1090 1095 

cgt aat gat atg tat get aag ctt caa gaa tac tec cat cat gaa tat 3417 
Arg Asn Asp Met Tyr Ala Lys Leu Gin Glu Tyr Ser His His Glu Tyr 
1100 1105 ' 1110 

gaa cag ata ggt gta tct tea eta gtg aca cgt atg aca age gat act 3465 
Glu Gin He Gly Val Ser Ser Leu Val Thr Arg Met Thr Ser Asp Thr 
1115 1120 H25 

ttt gtt ttg atg caa ttt get gaa atg tct tta cgt tta ggc eta gta 3513 
Phe Val Leu Met Gin Phe Ala Glu Met Ser Leu Arg Leu Gly Leu Val 
1130 1135 1140 

act cct atg gta atg att ttt age gtg gtt atg ata eta att acg agt 3561 
Thr Pro Met Val Met He Phe Ser Val Val Met He Leu He Thr Ser 
1145 1150 1155 1160 

cca tct ttg get tgg ctt gta gcg gtt gcg atg cct ctt ttg gta gga 3609 
Pro Ser Leu Ala Trp Leu Val Ala Val Ala Met Pro Leu Leu Val Gly 
1165 1170 1175 

gtc gtt tta tat gta get ata aaa aca aaa cct tta tct gaa aga caa 3657 
Val Val Leu Tyr Val Ala He Lys Thr Lys Pro Leu Ser Glu Arg Gin 
1180 1185 1190 
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cag act atg ctt gat aaa ate aat caa tat gtt cgt gaa aat tta aca 3705 
Gin Thr Met Leu Asp Lys lie Asn Gin Tyr Val Arg Glu Asn Leu Thr 
1195 1200 1205 

ggg tta cgc gtt gtt aga gec ttt gca aga gag aat ttt caa tea caa 3753 
Gly Leu Arg Val Val Arg Ala Phe Ala Arg Glu Asn Phe Gin Ser Gin 
1210 1215 1220 

aaa ttt caa gtc get aac caa cgt tac aca gat act tea act ggt ctt 3801 
Lys Phe Gin Val Ala Asn Gin Arg Tyr Thr Asp Thr Ser Thr Gly Leu 
1225 1230 1235 1240 

ttt aaa tta aca ggg eta aca gaa cca ctt ttc gtt caa att att att 3849 
Phe Lys Leu Thr Gly Leu Thr Glu Pro Leu Phe Val Gin lie lie lie 
1245 1250 1255 



gca atg att gtg get ate gtt tgg ttt get ttg gat ccc tta caa aga 
Ala Met He Val Ala He Val Trp Phe Ala Leu Asp Pro Leu Gin Arg 
1260 1265 ~ 1270 



3897 



ggt get att aaa ata ggg gat tta gtt get ttt ate gaa tat age ttc 
Gly Ala He Lys He Gly Asp Leu Val Ala Phe He Glu Tyr Ser Phe 
1275 1280 1285 



3945 



cat get etc ttt tea ttt ttg eta ttt gee aat ctt ttt act atg tat 
His Ala Leu Phe Ser Phe Leu Leu Phe Ala Asn Leu Phe Thr Met Tyr 
1290 1295 1300 



3993 



cct cgt atg gtg gta tea age cat cgt att aga gag gtg atg gat atg 
Pro Arg Met Val Val Ser Ser His Arg He Arg Glu Val Met Asp Met 
1305 1310 1315 



1320 



4041 



cca ate tct ate aat cct aat gee gaa ggt gtt acg gat acg aaa ctt 4089 
Pro He Ser He Asn Pro Asn Ala Glu Gly Val Thr Asp Thr Lys Leu 
1325 1330 1335 

aaa ggg cat tta gaa ttt gat aat gta aca ttc get tat cca gga gaa 4137 
Lys Gly His Leu Glu Phe Asp Asn Val Thr Phe Ala Tyr Pro Gly Glu 
1340 1345 1350 



aca gag agt ccc gtt ttg cat gat att tct ttt aaa get aag cct gga 
Thr Glu Ser Pro Val Leu His Asp He Ser Phe Lys Ala Lys Pro Gly 
1355 1360 1365 



4185 



gaa aca att get ttt att ggt tea aca ggt tea gga aaa tct tct ctt 4233 

Glu Thr He Ala Phe He Gly Ser Thr Gly Ser Gly Lys Ser Ser Leu 
1370 1375 1380 

gtt aat ttg att cca cgt ttt tat gat gtg aca ctt gga aaa ate tta 4281 

Val Asn Leu lie Pro Arg Phe Tyr Asp Val Thr Leu Gly Lys He Leu 
I 385 1390 1395 J ^ 1400 



gta gat gga gtt gat gta aga gat tat aac ctt aaa tea ctt cgc caa 
Val Asp Gly Val Asp Val Arg Asp Tyr Asn Leu Lys Ser Leu Arg Gin 
1405 1410 1415 



4329 
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aag att gga ttt ate ccc caa aaa get ctt tta ttt aca ggg aca ata 4377 
Lys He Gly Phe He Pro Gin Lys Ala Leu Leu Phe Thr Gly Thr He 
1420 1425 1430 

gga gag aat tta aaa tat gga aaa get gat get act att gat gat ctt 4425 
Gly Glu Asn Leu Lys Tyr Gly Lys Ala Asp Ala Thr He Asp Asp Leu 
1435 1440 1445 

aga caa gcg gtt gat att tct caa get aaa gag ttt att gag agt cac 4473 
Arg Gin Ala Val Asp He Ser Gin Ala Lys Glu Phe He Glu Ser His 
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125 130 135 * ! 140 



432 
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att aaa gtc aat ggc aac tat tat gtt tac etc aag cca ggt agt aag 480 
He Lys Val Asn Gly Asn Tyr Tyr Val Tyr Leu Lys Pro Gly Ser Lys 
145 150 ' 155 

cgc aaa aac att cga acc aaa caa caa att get gag caa gta gec aaa 528 
Arg Lys Asn He Arg Thr Lys Gin Gin He Ala Glu Gin Val Ala Lys 
160 165 170 

gga act aaa gaa get aaa gaa aaa ggt tta get caa gtg gec cat etc 576 
Gly Thr Lys Glu Ala Lys Glu Lys Gly Leu Ala Gin Val Ala His Leu 
175 180 185 

agt aaa gaa gaa gtt gcg gca gtc aat gaa gca aaa aga caa gga cgc 624 
Ser Lys Glu Glu Val Ala Ala Val Asn Glu Ala Lys Arg Gin Gly Arg 
190 195 200 

tat act aca gac gat ggc tat att ttt agt ccg aca gat ate att gat 672 
Tyr Thr Thr Asp Asp Gly Tyr He Phe Ser Pro Thr Asp He He Asp 
205 210 215 220 

gat tta gga gat get tat tta gta cct cat ggt aat cac tat cat tat 720 
Asp Leu Gly Asp Ala Tyr Leu Val Pro His Gly Asn His Tyr His Tyr 
225 230 235 

att cct aaa aag gat ttg tct cca agt gag eta get get gca caa gec 768 
He Pro Lys Lys Asp Leu Ser Pro Ser Glu Leu Ala Ala Ala Gin Ala 
240 245 250 

tac tgg agt caa aaa caa ggt cga ggt get aga ccg tct gat tac cgc 816 
Tyr Trp Ser Gin Lys Gin Gly Arg Gly Ala Arg Pro Ser Asp Tyr Arg 
255 260 .265 

ccg aca cca gee cca ggt cgt agg aaa gec cca att cct gat gtg acg 864 
Pro Thr Pro Ala Pro Gly Arg Arg Lys Ala Pro He Pro Asp Val Thr 
270 275 280 

cct aac cct gga caa ggt cat cag cca gat aac ggt ggc tat cat cca 912 
Pro Asn Pro Gly Gin Gly His Gin Pro Asp Asn Gly Gly Tyr His Pro 
285 290 295 ' 300 

gcg cct cct agg cca aat gat gcg tea caa aac aaa cac caa aga gat 960 
Ala Pro Pro Arg Pro Asn Asp Ala Ser Gin Asn Lys His Gin Arg Asp 
305 310 315 



gag ttt aaa gga aaa acc ttt aag gaa ctt tta gat caa eta cac cgt 
Glu Phe Lys Gly Lys Thr Phe Lys Glu Leu Leu Asp Gin Leu His Arg 
320 325 330 



1008 



ctt gat ttg aaa tac cgt cat gtg gaa gaa gat ggg ttg att ttt gaa 1056 

Leu Asp Leu Lys Tyr Arg His Val Glu Glu Asp Gly Leu He Phe Glu 
335 340 345 

ccg act caa gtg ate aaa tea aac get ttt ggg tat gtg gtg cct cat 1104 

Pro Thr Gin Val He Lys Ser Asn Ala Phe Gly Tyr Val Val Pro His 

350 355 360 
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gga gat cat tat cat att ate cca aga agt cag tta tea cct ctt gaa 1152 

Gly Asp His Tyr His lie He Pro Arg Ser Gin Leu Ser Pro Leu Glu 
365 370 375 380 



atg gaa tta gca gat cga tac tta get ggc caa act gag gac aat gac 
Met Glu Leu Ala Asp Arg Tyr Leu Ala Gly Gin Thr Glu Asp Asn Asp 
385 390 ' 395 



1200 



tea ggt tea gag cac tea aaa cca tea gat aaa gaa gtg aca cat acc 1248 
Ser Gly Ser Glu His Ser Lys Pro Ser Asp Lys Glu Val Thr His Thr 
400 405 * 410 

ttt ctt ggt cat cgc ate aaa get tac gga aaa ggc tta gat ggt aaa 1296 
Phe Leu Gly His Arg He Lys Ala Tyr Gly Lys Gly Leu Asp Gly Lys 
415 420 425 

cca tat gat acg agt gat get tat gtt ttt agt aaa gaa tec att cat 1344 
Pro Tyr Asp Thr Ser Asp Ala Tyr Val Phe Ser Lys Glu Ser He His 
430 435 440 

tea gtg gat aaa tea gga gtt aca get aaa cac gga gat cat ttc cac 1392 
Ser Val Asp Lys Ser Gly Val Thr Ala Lys His Gly Asp His Phe His 
445 450 455 460 

tat ata gga ttt gga gaa ctt gaa caa tat gag ttg gat gag gtc get 1440 
Tyr He Gly Phe Gly Glu Leu Glu Gin Tyr Glu Leu Asp Glu Val Ala 
465 470 475 

aac tgg gtg aaa gca aaa ggt caa get gat gag ctt get get get ttg 1488 
Asn Trp Val Lys Ala Lys Gly Gin Ala Asp Glu Leu Ala Ala Ala Leu 
480 485 490 

gat cag gaa caa ggc aaa gaa aaa cca etc ttt gac act aaa aaa gtg 1536 
Asp Gin Glu Gin Gly Lys Glu Lys Pro Leu Phe Asp Thr Lys Lys Val 
495 500 ' 505 

agt cgc aaa gta aca aaa gat ggt aaa gtg ggc tat atg atg cca aaa 1584 
Ser Arg Lys Val Thr Lys Asp Gly Lys Val Gly Tyr Met Met Pro Lys 
510 515 520 

gat ggt aag gac tat ttc tat get cgt gat caa ctt gat ttg act cag 1632 
Asp Gly Lys Asp Tyr Phe Tyr Ala Arg Asp Gin Leu Asp Leu Thr Gin 
525 530 535 540 

att gee ttt gee gaa caa gaa eta atg ctt aaa gat aag aag cat tac 1680 
He Ala Phe Ala Glu Gin Glu Leu Met Leu Lys Asp Lys Lys His Tyr 
545 550 " 555 

cgt tat gac att gtt gac aca ggt att gag cca cga ctt get gta gat 1728 
Arg Tyr Asp He Val Asp Thr Gly He Glu Pro Arg Leu Ala Val Asp 
560 565 570 

gtg tea agt ctg ccg atg cat get ggt aat get act tac gat act gga 1776 
Val Ser Ser Leu Pro Met His Ala Gly Asn Ala Thr Tyr Asp Thr Gly 
575 580 585 
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agt teg ttt gtt ate cca cat att gat cat ate cat gtc gtt ccg tat 1824 
Ser Ser Phe Val He Pro His He Asp His He His Val Val Pro Tyr 
590 595 600 

tea tgg ttg acg cgc gat cag att gca aca gtc aag tat gtg atg caa 1872 
Ser Trp Leu Thr Arg Asp Gin He Ala Thr Val Lys Tyr Val Met Gin 
605 610 615 ' 620 

cac ccc gaa gtt cgt ccg gat gta tgg tct aag cca ggg cat gaa gag 1920 
His Pro Glu Val Arg Pro Asp Val Trp Ser Lys Pro Gly His Glu Glu 
625 630 635 

tea ggt teg gtc att cca aat gtt acg cct ctt gat aaa cgt get ggt 1968 
Ser Gly Ser Val He Pro Asn Val Thr Pro Leu Asp Lys Arg Ala Gly 
640 645 650 

atg cca aac tgg caa att ate cat tct get gaa gaa gtt caa aaa gee 2016 
Met Pro Asn Trp Gin He He His Ser Ala Glu Glu Val Gin Lys Ala 
655 660 665 

eta gca gaa ggt cgt ttt gca aca cca gac ggc tat att ttc gat cca 2064 
Leu Ala Glu Gly Arg Phe Ala Thr Pro Asp Gly Tyr He Phe Asp Pro 
670 675 680 

cga gat gtt ttg gec aaa gaa act ttt gta tgg aaa gat ggc tec ttt 2112 
Arg Asp Val Leu Ala Lys Glu Thr Phe Val Trp Lys Asp Gly Ser Phe 
685 690 695 * ' 700 



age ate cca aga gca gat ggc agt tea ttg aga acc att aat aaa tct 
Ser He Pro Arg Ala Asp Gly Ser Ser Leu Arg Thr He Asn Lys Ser 
705 710 715 



2160 



gat eta tec caa get gag tgg caa caa get caa gag tta ttg gca aag 2208 
Asp Leu Ser Gin Ala Glu Trp Gin Gin Ala Gin Glu Leu Leu Ala Lys 
720 725 730 

aaa aat act ggt gat get act gat acg gat aaa ccc aaa gaa aag caa 2256 
Lys Asn Thr Gly Asp Ala Thr Asp Thr Asp Lys Pro Lys Glu Lys Gin 
735 740 745 

cag gca gat aag age aat gaa aac caa cag cca agt gaa gee agt aaa 2304 
Gin Ala Asp Lys Ser Asn Glu Asn Gin Gin Pro Ser Glu Ala Ser Lys 
750 755 760 

gaa gaa aaa gaa tea gat gac ttt ata gac agt tta cca gac tat ggt 2352 
Glu Glu Lys Glu Ser Asp Asp Phe He Asp Ser Leu Pro Asp Tyr Gly 
765 770 775 780 

eta gat aga gca acc eta gaa gat cat ate aat caa tta gca caa aaa 2400 
Leu Asp Arg Ala Thr Leu Glu Asp His He Asn Gin Leu Ala Gin Lys 
785 790 795 

get aat ate gat cct aag tat etc att ttc caa cca gaa ggt gtc caa 2448 
Ala Asn He Asp Pro Lys Tyr Leu He Phe Gin Pro Glu Gly Val Gin 
800 805 810 
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ttt tat aat aaa aat ggt gaa ttg gta act tat gat ate aag aca ctt 2496 
Phe Tyr Asn Lys Asn Gly Glu Leu Val Thr Tyr Asp He Lys Thr Leu 
815 820 * " 825 

caa caa ata aac cct taaccaaaag aagatctcat tgttaaagca ctgctttgtc 2551 
Gin Gin He Asn Pro 
830 

aaagcaagtt acggtgattt tgaagtcatt ctatgtaacg agtagtgata aaagttggat 2611 

aatagcggtt ttcttttgea aagaaatggt atccatgtta gaatagtaaa aaaagaggag 2671 

gattcttgga ctaatgtcaa ataagtagac agaaaactgt gttattttattgcgt 2726 

taaaataatt ttcttctttc tgattagggg ttagtcctag attagcegta tgtgggttgt 2786 

aattgttata aaaattctca atgtattcaa agcagtctaa ttgaacctgt ttgatatttt 2846 

gataatgttt teggttgatt tgtctatget ttaaatactt gaaaaatget teagttaegg 2906 

cattatcata aggatatcca ggattagaaa aagaatgeat gatattggca ctgcacccta 2966 

atagtgagac gcaagaaaaa cacttttaggcaatcagtt ttctgtactg tacaggegae 3025 

tggtcgttta atctctgttg aattctagtt tcattataaa atgtaatgta atttttaaca 3085 

atatttgtta tactatcttt gttgtatttt ctcctattat ggaaataaaa ggtttcagtc 3145 

tttaggaegg tgtgaaacca ttcaatacag gcattatctg caggtgttcc ttttcgagac 3205 

attgagegga taatgtcttt ttccgtgcaa gcctggtagt aagecataga agtatacact 3265 

gagccttggt cactgtgtaa gattgetect ttatttaggcaatt ttaactgatt 3319 

aagggtgtct agtacaaaat ccgtgtcctg acaatctgag atagtgtaag ctataatttc 3379 

teggttatag agattcataa ttgatgagag atacaattta cagttaccga aatataggta 3439 

ggtaatatct gttacgagct tttccttagg ettateggea tggaaatccc gactcaattt 3499 

attatctgtt aaataataag ctttacccaa attgggaact ttcttggtac gtgtccgaca 3559 

aagccagcca ttatttttca tgatacgata gactttcttt gtattaacag teaatcegtg 3619 

gatttttttg ageaategtg taatggtacg atagecataa ataaagtgat tctccataca 3679 

gagctgttca attaattcaa taaggtcatc tttttttgcg gcttctcata ctcctttttc 3739 

caaeggtaat aggtcgaccg cttgacctta aaacagtcta gaatgaaaac tategggtag 3799 

ttgtttttat agtcttccac aagcttgata agacttactttatcgatt tccttatcaa 3857 

gcctcgatac ttttttaaga ggtcaacctg taattgtaat tgttccactt cagacagatg 3917 

ttccaagcct ttacegtagg tatattgett gccaacacct tgatgaaaac gataaagctc 3977 

ctegtttteg taccatttca tccaagtata gatttgacta ttatttttga tgcctaaagt 4037 

ctccataata actctgttag acttgcctgc tttcttcata tegatgeaag ccagcttagt 4097 

ttcccatgaa tatgettttt taaccataat aaaacattcc tgtttctagt ttactaaatt 4157 

tcaacaggag tgtttttctt ttgtctcatt ttagggattc agtgcctatt gttgtcatca 4217 

attatttttc taaattcccc ggactcaaat tgtgaccctt ggtcggaatg aaagagaagt 4277 

gttccttcaa tctttctttt attaagtgaa aaggcaacac ttttctgtac aacatttata 4337 

aagtgttttt ctaggcaattaatc ttttagtcat tggtgtttgg tagttgagac 4391 

taccatgaat gcggtggtaa ttccaccaat gaacatagtc tttagtctta agagctagtt 4451 

cttccagcaa ttgaaaggtt tcttgataaa caaattcaat tttgaaagca egataegtae 4511 

tttcagctac ggcattgtca taaggataac cagcctgact aagcgaacgt gtgattccaa 4571 

aggcttccaa tatttcatca attaactgat tatcaaactc tttgecaega tctgaatgga 4631 

acatcttgac tttggtcagg gcgtaaggga tgctttgtat ggcttgetta acgagttcag 4691 

cggtcttgtg ccaaccaaga gaeaggcega tgatttcacg gttgtatagg tcaatgatga 4751 

ggcaaacata agcccaacga ttgcctacac gaacataggt taagtcagtg actaaggctt 4811 

gtagtggtct ttcttgetta aattgcctgt ctaagtggtt gggaataggg gcttcattct 4871 

tgectctaga atgtggtttg aaggtggctt tctgataaac agaaaccaaa ttgagtcget 4931 

teataatgeg tcgaatccga cgacgtgaaa gtgtgatacc ttcgttattc aagcatattt 4991 

tgatttttct ggatcegtat ctagactcgc tatcgagaaa aattctttta atagtttctt 5051 

caaactccgt ttcagatact gactccacgg cttgatagta ataacttgag tgtggcatat 5111 

tcagccagcg acacatcttt gaaatgctgt atttatcctt attagcagtg attatttccc 5171 

tttttgtgcc ataatcaccg ctgettgett taggatatct aatt 5215 

<210> 14 

<211> 40 

<212> PRT 
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<213> Streptococcus 



<400> 14 

Phe Gly Ser Ala Leu Ser Thr Val Glu Val Lys Glu lie lie Ser Glu 

1 5 10 15 

Glu Asn He Trp Leu Tyr Arg Leu Ser Cys Cys His Phe Thr Ser Tyr 

20 25 30 

Ser Tyr Trp Lys Leu Pro Thr Trp 
35 40 



<210> 15 

<211> 793 

<212> PRT 

<213> Streptococcus 



<400> 15 



Met Gly 


Leu 


Ala 


Thr 


Lys 


Asp Asn 


Gin He 


Ala Tyr He 


Asp Asp Ser 


1 






5 








10 




15 


Lys Gly 


Lys 


Ala 


Lys 


Ala 


Pro Lys 


Thr Asn 


Lys Thr Met 


Asp Gin He 






20 










25 




30 


Ser Ala 


Glu 


Glu 


Gly 


He 


Ser 


Ala 


Glu Gin 


He Val Val 


Lys lie Thr 




35 










40 




45 


Asp Gin 


Gly Tyr 


Val 


Thr 


Ser 


His 


Gly Asp 


His Tyr His 


Phe Tyr Asn 


50 










55 






60 


Gly Lys 


Val 


Pro 


Tyr 


Asp 


Ala 


He 


He Ser 


Glu Glu Leu 


Leu Met Thr 










70 








75 


80 


Asp Pro 


Asn 


Tyr 


Arg 


Phe 


Lys 


Gin 


Ser Asp 


Val He Asn 


Glu He Leu 








85 








90 




95 


Asp vjiy 


Tyr Val 


Tl- 
116 


Lys 


Val 


Asn 


Gly Asn 


Tyr Tyr Val 


Tyr Leu Lys 






100 










105 




110 


Pro Gly 


Ser 


Lys 


Arg 


Lys 


Asn 


He 


Arg Thr 


Lys Gin Gin 


He Ala Glu 




115 










120 




125 




Gin Val 


Ala 


Lys 


Gly 


Thr 


Lys 


Glu 


Ala Lys 


Glu Lys Gly Leu Ala Gin 


130 










135 






140 




Val Ala 


His 


Leu 


Ser 


Lys 


Glu 


Glu 


Val Ala 


Ala Val Asn Glu Ala Lys 


145 








150 








155 


160 


Arg Gin 


Gly Arg 


Tyr 


Thr 


Thr Asp 


Asp Gly 


Tyr He Phe 


Ser Pro Thr 








165 








170 




175 


Asp He 


He 


Asp 


Asp 


Leu 


Gly Asp 


Ala Tyr 


Leu Val Pro 


His Gly Asn 






180 










185 




190 


His Tyr 


His 


Tyr 


He 


Pro 


Lys 


Lys 


Asp Leu 


Ser Pro Ser 


Glu Leu Ala 




195 










200 




205 




Ala Ala 


Gin 


Ala 


Tyr 


Trp 


Ser 


Gin 


Lys Gin 


Gly Arg Gly Ala Arg Pro 


210 










215 






220 




Ser Asp 


Tyr Arg 


Pro 


Thr 


Pro 


Ala 


Pro Gly 


Arg Arg Lys Ala Pro He 


225 








230 








235 


240 


Pro Asp 


Val 


Thr 


Pro 


Asn 


Pro Gly 


Gin Gly 


His Gin Pro Asp Asn Gly 








245 








250 




255 


Gly Tyr 


His 


Pro 


Ala 


Pro 


Pro Arg 


Pro Asn 


Asp Ala Ser Gin Asn Lys 






260 










265 




270 


His Gin 


Arg Asp 


Glu 


Phe 


Lys Gly 


Lys Thr 


Phe Lys Glu Leu Leu Asp 




275 










280 




285 




Gin Leu 


His 


Arg 


Leu 


Asp 


Leu Lys 


Tyr Arg 


His Val Glu Glu Asp Gly 


290 










295 






300 




Leu He 


Phe 


Glu 


Pro 


Thr 


Gin 


Val 


He Lys 


Ser Asn Ala 


Phe Gly Tyr 


305 








310 








315 


320 
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Val Val 


Pro 


His 


Ser Pro 


Leu 


GlU 






1 ^ f\ 

.34 U 


Glu Asp 


Asn 


Asp 




355 




Val Tnr 


His 


Thr 


j / u 






Leu Asp 


Gly 


Lys 


385 






Glu Ser 


He 


His 


Asp His 


rne 


TT ^ ft 

HIS 






A 9 ft 
ft 4 U 


Asp Glu 


vai 


T\ "1 _ 

Aia 




A 1 K 
4 




TA 1 M 3 

Aia Aia 


Aia 


Leu 


4 50 






inr jjys 


Lys 


v ai 


465 






Met Met 


Pro 


Lys 


TV t^y\ T All 

Asp Lieu 


inr 


/tin 
Lrin 






bUU 


Lys Lys 


His 


Tyr 








Leu Ala 


val 


Asp 








Tyr Asp 


Thr 


Gly 


545 






Val Val 


Pro 


Tyr 


ry r va i 


Met 


Gin 






580 


Gly His 


Glu 


Glu 




bSJb 




bys Arg 


Aia 


biy 


DiU 






Val Gin 


Lys 


Ala 


625 






He Phe 


Asp 


Pro 


Asp Gly 


Ser 


Phe 






ccr\ 

DoU 


He Asn 


Lys 


Ser 




675 




Leu Leu 


Ala 


Lys 


690 






Lys Glu 


Lys 


Gin 


705 






Glu Ala 


Ser 


Lys 


Pro Asp 


Tyr 


Gly 






740 


Leu Ala 


Gin 


Lys 




755 





Gly Asp His Tyr 
325 

Met Glu Leu Ala 

Ser Gly Ser Glu 
360 

Phe Leu Gly His 
375 

Pro Tyr Asp Thr 
390 

Ser Val Asp Lys 
405 

Tyr He Gly Phe 

Asn Trp Val Lys 
440 

Asp Gin Glu Gin 
455 

Ser Arg Lys Val 
470 

Asp Gly Lys Asp 
485 

He Ala Phe Ala 

Arg Tyr Asp He 
520 

Val Ser Ser Leu 
535 

Ser Ser Phe Val 
550 

Ser Trp Leu Thr 
565 

His Pro Glu Val 

Ser Gly Ser Val 
600 

Met Pro Asn Trp 
615 

Leu Ala Glu Gly 
630 

Arg Asp Val Leu 
645 

Ser He Pro Arg 

Asp Leu Ser Gin 
680 

Lys Asn Thr Gly 
695 

Gin Ala Asp Lys 
710 

Glu Glu Lys Glu 
725 

Leu Asp Arg Ala 

Ala Asn He Asp 
760 
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His 


He 


He 


Pro 




330 






Asp 


Arg 


Tyr 


Leu 


345 








His 


Ser 


Lys 


Pro 


Arg 


lie 


Lys 


Ala 








3 80 


Ser 


Asp 


Ala 


Tyr 






395 




Ser 


Gly 


Val 


Thr 




410 






Gly 


Glu 


Leu 


Glu 


425 








Ala 


Lys 


Gly 


Gin 


Gly 


Lys 


Glu 


Lys 








460 


Thr 


Lys 


Asp 


Gly 






A *7 C 

4 /b 




Tyr 


Phe 


Tyr 


Ala 










Glu 


Gin 


Glu 


Leu 


505 








Val 


Asp 


Thr 


Gly 


Pro 


Met 


His 


Ala 








54 0 


He 


Pro 


His 


He 






555 




Arg 


Asp 


Gin 


He 




570 






Arg 


Pro 


Asp 


Val 


585 








He 


Pro 


Asn 


Val 


Gin 


He 


He 


His 








620 


Arg 


Phe 


Ala 


Thr 






bib 




Ala 


Lys 


Glu 


Thr 




CCA 






TV 1 -v 

Ala 


Asp 


Gly 


Ser 


ODD 








Ala 


Glu 


Trp 


Gin 


Asp 


Ala 


Thr 


Asp 








700 


Ser 


Asn 


Glu 


Asn 






715 




Ser 


Asp 


Asp 


Phe 




730 






Thr 


Leu 


Glu 


Asp 


745 








Pro 


Lys 


Tyr 


Leu 



Arg 


Ser 


Gin 


Leu 






335 




Ala 


Gly 


Gin 


Thr 




350 






Ser 


Asp 


Lys 


Glu 


365 








Tyr 


Gly 


Lys 


Gly 


Val 


Phe 


Ser 


Lys 








400 


Ala 


Lys 


His 


Gly 






415 




Gin 


Tyr 


Glu 


Leu 




430 






Ala 


Asp 


Glu 


Leu 


445 








Pro 


Leu 


Phe 


Asp 


Lys 


Val 


Gly 


Tyr 








480 


Arg 


Asp 


Gin 


Leu 






495 




Met 


Leu 


Lys 


Asp 




510 






lie 


Glu 


Pro 


Arg 


525 








Gly 


Asn 


Ala 


Thr 


Asp 


His 


He 


His 








560 


Ala 


Thr 


Val 


Lys" 






575 




Trp 


Ser 


Lys 


Pro 




590 






Thr 


Pro 


Leu 


Asp 


605 








Ser 


Ala 


Glu 


Glu 


Pro 


Asp 


Gly 


Tyr 








640 


Phe 


Val 


Trp 


Lys 






655 




Ser 


Leu 


Arg 


Thr 




670 






Gin 


Ala 


Gin 


Glu 


685 








Thr 


Asp 


Lys 


Pro 


Gin 


Gin 


Pro 


Ser 








720 


He 


Asp 


Ser 


Leu 






735 




His 


He 


Asn 


Gin 




750 






He 


Phe 


Gin 


Pro 



765 
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Glu Gly Val Gin Phe Tyr Asn Lys Asn Gly Glu Leu Val Thr Tyr Asp 

770 775 780 

He Lys Thr Leu Gin Gin He Asn Pro 
785 790 

<210> 16 
<211> 715 
<212> PRT 

<213> Streptococcus 
<400> 16 

Met Thr Asp Pro Asn Tyr Arg Phe Lys Gin Ser Asp Val He Asn Glu 

15 10 15 

He Leu Asp Gly Tyr Val He Lys Val Asn Gly Asn Tyr Tyr Val Tyr 

20 25 30 

Leu Lys Pro Gly Ser Lys Arg Lys Asn He Arg Thr Lys Gin Gin He 

35 40 45 

Ala Glu Gin Val Ala Lys Gly Thr Lys Glu Ala Lys Glu Lys Gly Leu 

50 55 60 

Ala Gin Val Ala His Leu Ser Lys Glu Glu Val Ala Ala Val Asn Glu 
65 70 75 80 

Ala Lys Arg Gin Gly Arg Tyr Thr Thr Asp Asp Gly Tyr He Phe Ser 

85 90 95 

Pro Thr Asp He He Asp Asp Leu Gly Asp Ala Tyr Leu Val Pro His 

100 105 no 

Gly Asn His Tyr His Tyr He Pro Lys Lys Asp Leu Ser Pro Ser Glu 

115 120 125 

Leu Ala Ala Ala Gin Ala Tyr Trp Ser Gin Lys Gin Gly Arg Gly Ala 

130 135 140 

Arg Pro Ser Asp Tyr Arg Pro Thr Pro Ala Pro Gly Arg Arg Lys Ala 
145 150 155 160 

Pro He Pro Asp Val Thr Pro Asn Pro Gly Gin Gly His Gin Pro Asp 

165 170 175 

Asn Gly Gly Tyr His Pro Ala Pro Pro Arg Pro Asn Asp Ala Ser Gin 

180 185 ~ 190 

Asn Lys His Gin Arg Asp Glu Phe Lys Gly Lys Thr Phe Lys Glu Leu 

195 200 ~ 205 

Leu Asp Gin Leu His Arg Leu Asp Leu Lys Tyr Arg His Val Glu Glu 

210 215 220 

Asp Gly Leu He Phe Glu Pro Thr Gin Val He Lys Ser Asn Ala Phe 
225 230 235 240 

Gly Tyr Val Val Pro His Gly Asp His Tyr His He He Pro Arg Ser 

245 250 255 

Gin Leu Ser Pro Leu Glu Met Glu Leu Ala Asp Arg Tyr Leu Ala Gly 

260 265 270 

Gin Thr Glu Asp Asn Asp Ser Gly Ser Glu His Ser Lys Pro Ser Asp 

275 280 285 

Lys Glu Val Thr His Thr Phe Leu Gly His Arg He Lys Ala Tyr Gly 

290 295 300 

Lys Gly Leu Asp Gly Lys Pro Tyr Asp Thr Ser Asp Ala Tyr Val Phe 
305 310 315 320 

Ser Lys Glu Ser He His Ser Val Asp Lys Ser Gly Val Thr Ala Lys 

325 330 335 

His Gly Asp His Phe His Tyr He Gly Phe Gly Glu Leu Glu Gin Tyr 
340 345 ' 350 
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<400> 17 
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15 10 15 
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<210> 22 
<211> 5058 
<212> DNA 

<213> streptococcus 

<220> 

<221> CDS 

<222> (1) . . . (663) 

<221> CDS 

<222> (763) . . . (1344) 
<221> CDS 

<222> (1362) . . . (1739) 
<221> CDS 

<222> (2266) . . . (5058) 
<400> 22 

aat ttg aaa gca gaa tta tct gta gaa gat gag caa tat aca gca aca 4 8 

Asn Leu Lys Ala Glu Leu Ser Val Glu Asp Glu Gin Tyr Thr Ala Thr 
15 10 15 

gtt tat ggt aaa tct get cat ggt tea aca cca caa gaa ggt gtt aat 96 
Val Tyr Gly Lys Ser Ala His Gly Ser Thr Pro Gin Glu Gly Val Asn 
20 25 30 

• ggg gcg act tat tta get ctt tat eta agt caa ttt gat ttt gaa ggt 144 
Gly Ala Thr Tyr Leu Ala Leu Tyr Leu Ser Gin Phe Asp Phe Glu Gly 
35 40 45 

cct get cgt get ttc tta gat gtt aca gee aac att att cac gaa gac 192 
Pro Ala Arg Ala Phe Leu Asp Val Thr Ala Asn He He His Glu Asp 
50 55 60 

ttc tea ggt gaa aaa ctt gga gta get tat gaa gat gac tgt atg gga 240 
Phe Ser Gly Glu Lys Leu Gly Val Ala Tyr Glu Asp Asp Cys Met Gly 
65 70 75 80 

cca ttg age atg aat gca ggt gtc ttc cag ttt gat gaa act aat gat 288 
Pro Leu Ser Met Asn Ala Gly Val Phe Gin Phe Asp Glu Thr Asn Asp 
85 90 95 

gat aat act ate get ctt aat ttc cgt tac cca caa ggg aca gat get 336 
Asp Asn Thr He Ala Leu Asn Phe Arg Tyr Pro Gin Gly Thr Asp Ala 
100 105 HO 

aaa act ate caa act aag ctt gag aaa ctt aac gga gtt gaa aaa gtg 384 
Lys Thr He Gin Thr Lys Leu Glu Lys Leu Asn Gly Val Glu Lys Val 
115 120 125 

act ctt tct gac cat gaa cac aca cca cac tat gta cct atg gac gat 432 
Thr Leu Ser Asp His Glu His Thr Pro His Tyr Val Pro Met Asp Asp 
130 135 " 140 
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gaa tta gta tea acc tta eta get gtc tat gaa aag caa act ggt ctt 480 
Glu Leu Val Ser Thr Leu Leu Ala Val Tyr Glu Lys Gin Thr Gly Leu 
145 150 155 160 

aaa gga cat gaa cag gtt att ggt ggt ggg aca ttt ggt cgc tta ctt 528 
Lys Gly His Glu Gin Val lie Gly Gly Gly Thr Phe Gly Arg Leu Leu 
165 170 175 

gaa egg ggt gtt gca tac ggt gee atg ttc cca gga gat gaa aac act 576 
Glu Arg Gly Val Ala Tyr Gly Ala Met Phe Pro Gly Asp Glu Asn Thr 
180 185 190 

atg cat caa get aat gag tac atg cct tta gaa aat att ttc cgt teg 624 
Met His Gin Ala Asn Glu Tyr Met Pro Leu Glu Asn lie Phe Arg Ser 
195 200 205 

get get ate tac gca gaa get ate tat gaa tta ate aaa taaaataatc 673 
Ala Ala He Tyr Ala Glu Ala He Tyr Glu Leu He Lys 
210 215 220 

cttaaactaa atatgtgatc aatgataaag ggtggtgaag acatgaaagt gtctttgect 733 
cttttcataa ggttagattt ggagacttt atg act gac ttg gaa aaa att att 786 

Met Thr Asp Leu Glu Lys He He 
225 

aaa gca ata aaa agt gat tea cag aat caa aat tat aca gaa aat ggt 834 
Lys Ala He Lys Ser Asp Ser Gin Asn Gin Asn Tyr Thr Glu Asn Gly 
230 235 240 245 

att gat cct ttg ttt get get cct aaa aca get agg ate aat att gtt 882 
He Asp Pro Leu Phe Ala Ala Pro Lys Thr Ala Arg He Asn He Val 
250 255 260 

ggc caa gca cct ggt tta aaa act caa gaa gca aga etc tat tgg aaa 930 
Gly Gin Ala Pro Gly Leu Lys Thr Gin Glu Ala Arg Leu Tyr Trp Lys 
2 65 270 275 

gat aaa tct gga gat cgt eta cgc cag tgg ctt gga gtt gat gaa gag 978 
Asp Lys Ser Gly Asp Arg Leu Arg Gin Trp Leu Gly Val Asp Glu Glu 
280 285 290 

aca ttt tac cat tct gga aaa ttt get gtt tta cct tta gat ttt tat 1026 
Thr Phe Tyr His Ser Gly Lys Phe Ala Val Leu Pro Leu Asp Phe Tyr 
295 300 305 

tac cca ggc aaa gga aaa tea gga gat tta ccc cct aga aaa ggt ttt 1074 
Tyr Pro Gly Lys Gly Lys Ser Gly Asp Leu Pro Pro Arg Lys Gly Phe 
310 315 320 325 

gcg gag aaa tgg cac cct ctt att tta aaa gaa atg cct aat gtt caa 1122 
Ala Glu Lys Trp His Pro Leu He Leu Lys Glu Met Pro Asn Val Gin 
330 335 340 

ttg acc ttg eta gtt ggt cag tat get cag aaa tat tat ctt gga age 1170 
Leu Thr Leu Leu Val Gly Gin Tyr Ala Gin Lys Tyr Tyr Leu Gly Ser 
345 350 355 
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tec gca cat aaa aat eta aca gaa aca gtt aaa get tac aaa gac tat 1218 
Ser Ala His Lys Asn Leu Thr Glu Thr Val Lys Ala Tyr Lys Asp Tyr 
360 365 370 

eta ccc gat tat tta ccc ctg gtt cac cca tea ccg cga aat caa att 1266 
Leu Pro Asp Tyr Leu Pro Leu Val His Pro Ser Pro Arg Asn Gin lie 
375 380 385 

tgg eta aag aag aat cca tgg ttt gaa aaa gat eta ate gtt gat tta 1314 
Trp Leu Lys Lys Asn Pro Trp Phe Glu Lys Asp Leu He Val Asp Leu 
390 395 400 " 405 

caa aag ata gta gca gat att tta aaa gat taaggatagg agttggt atg 1364 
Gin Lys He Val Ala Asp He Leu Lys Asp Met 
410 415 

aga gat aat cat eta cac acg tat ttt tec tat gat tgt caa acg gca 1412 
Arg Asp Asn His Leu His Thr Tyr Phe Ser Tyr Asp Cys Gin Thr Ala 
420 425 430 

ttt gag gac tat att aat ggt ttt aca ggt gaa ttt ate acg aca gaa 1460 
Phe Glu Asp Tyr He Asn Gly Phe Thr Gly Glu Phe He Thr Thr Glu 
435 440 445 

cat ttt gat tta tea aat cct tac ace ggt caa gac gat gtt cct gat 1508 
His Phe Asp Leu Ser Asn Pro Tyr Thr Gly Gin Asp Asp Val Pro Asp 
450 455 460 

tat agt get tat tgt caa aaa ata gat tat ctt aat cag aaa tat gga 1556 
Tyr Ser Ala Tyr Cys Gin Lys He Asp Tyr Leu Asn Gin Lys Tyr Gly 
465 470 475 480 

aat cga ttt aaa aaa gga att gaa ate ggt tat ttt aaa gat agg gaa 1604 
Asn Arg Phe Lys Lys Gly He Glu lie Gly Tyr Phe Lys Asp Arg Glu 



485 490 495 



tea gat att tta gat tat tta aaa aat aaa gaa ttt gat tta aaa eta 
Ser Asp lie Leu Asp Tyr Leu Lys Asn Lys Glu Phe Asp Leu Lys Leu 
500 505 510 



1652 



ttg tea ate cat cat aat ggt agg tat gat tat ctg caa gaa gaa get 1700 
Leu Ser He His His Asn Gly Arg Tyr Asp Tyr Leu Gin Glu Glu Ala 
515 520 525 

ctg aaa gta cca aca aag gga get ttt age aga tta ctt taatcgtatg 1749 
Leu Lys Val Pro Thr Lys Gly Ala Phe Ser Arg Leu Leu 
530 535 540 

gaatttgeca taggccgtgt ggaagegcac gttttagctc actttgatta tggttttcgt 1809 

aagttaaact tagatgtaga agatttaaaa ccgtttgaaa cgcaattgaa gegcatttte 1869 

ataaagatgt tatctaaggg gttagctttt gaactaaata ccaaatccct ttatctatat 1929 

gggaatgaaa aactttatcg etatgettta gagatactca aacagcttgg ttgtaaacaa 1989 

tactctatag getctgaegg tcatattcct gaacattttt gttatgaatt tgatagactt 2049 

caaggtctgc taaaggacta tcaaattgat gaaaatcatt tgatatgagg aaatttttga 2109 

taaaaaagct aggcaatatt gcttagcttt tttgtaatgc tattgatagt tttagtgaaa 2169 
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atttcaaaaa aataaagaaa tcatttactt gttgcaagcg cttgcgtaaa ttgttatgat 2229 
tttattggta acaattcatt aaaaaaggag aatgat atg aaa aga aaa gac tta 2283 

Met Lys Arg Lys Asp Leu 
545 

ttt ggt gat aaa caa act caa tac acg att aga aag tta agt gtt gga 2331 
Phe Gly Asp Lys Gin Thr Gin Tyr Thr lie Arg Lys Leu Ser Val Gly 
550 555 " 560 

gta get tea gtt aca aca ggg gta tgt att ttt ctt cat agt cca cag 2379 
Val Ala Ser Val Thr Thr Gly Val Cys lie Phe Leu His Ser Pro Gin 
565 570 575 

gta ttt get gaa gaa gta agt gtt tct cct gca act aca gcg att gca 2427 
Val Phe Ala Glu Glu Val Ser Val Ser Pro Ala Thr Thr Ala He Ala 
580 585 590 595 

gag teg aat att aat cag gtt gac aac caa caa tct act aat tta aaa 2475 
Glu Ser Asn He Asn Gin Val Asp Asn Gin Gin Ser Thr Asn Leu Lys 
600 605 610 

gat gac ata aac tea aac tct gag acg gtt gtg aca ccc tea gat atg 2523 
Asp Asp He Asn Ser Asn Ser Glu Thr Val Val Thr Pro Ser Asp Met 
615 620 625 

ccg gat ace aag caa tta gta tea gat gaa act gac act caa aag gga 2571 
Pro Asp Thr Lys Gin Leu Val Ser Asp Glu Thr Asp Thr Gin Lys Gly 
630 635 640 

gtg aca gag ccg gat aag gcg aca age ctg ctt gaa gaa aat aaa ggt 2619 
Val Thr Glu Pro Asp Lys Ala Thr Ser Leu Leu Glu Glu Asn Lys Gly 
645 650 655 

cct gtt tea gat aaa aat acc tta gat tta aaa gta gca cca tct aca 2667 
Pro Val Ser Asp Lys Asn Thr Leu Asp Leu Lys Val Ala Pro Ser Thr 
660 665 670 675 

ttg caa aat act ccc gac aaa act tct caa get ata ggt get cca age 2715 
Leu Gin Asn Thr Pro Asp Lys Thr Ser Gin Ala He Gly Ala Pro Ser 
680 685 690 

cct acc ttg aaa gta get aat caa get cca egg att gaa aat ggt tac 2763 
Pro Thr Leu Lys Val Ala Asn Gin Ala Pro Arg He Glu Asn Gly Tyr 
695 700 705 

ttt agg eta cat ctt aaa gaa ttg cct caa ggt cat cct gta gaa age 2811 
Phe Arg Leu His Leu Lys Glu Leu Pro Gin Gly His Pro Val Glu Ser 
710 715 720 

act gga ctt tgg ata tgg gga gat gtt gat caa ccg tct agt aat tgg 2859 
Thr Gly Leu Trp He Trp Gly Asp Val Asp Gin Pro Ser Ser Asn Trp 
725 730 735 

cca aat ggt get ate cct atg act gat get aag aaa gat gat tac ggt 2907 
Pro Asn Gly Ala He Pro Met Thr Asp Ala Lys Lys Asp Asp Tyr Gly 
740 745 750 755 
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tat tat gtt gat ttt aaa tta tct gaa aaa caa cga aaa caa ata tct 2955 
Tyr Tyr Val Asp Phe Lys Leu Ser Glu Lys Gin Arg Lys Gin lie Ser 
760 765 770 

ttt tta att aat aac aaa gca ggg aca aat tta age ggc gat cat cat 3003 
Phe Leu lie Asn Asn Lys Ala Gly Thr Asn Leu Ser Gly Asp His His 
775 780 785 

att cca tta tta cga cct gag atg aac caa gtt tgg att gat gaa aag 3051 
He Pro Leu Leu Arg Pro Glu Met Asn Gin Val Trp He Asp Glu Lys 
790 795 800 

tac ggt ata cat act tat caa ccc etc aaa gaa ggg tat gtc cgt att 3099 
Tyr Gly He His Thr Tyr Gin Pro Leu Lys Glu Gly Tyr Val Arg He 
805 810 815 

aac tat ttg agt tec tct agt aac tat gac cac tta tea gca tgg etc 3147 
Asn Tyr Leu Ser Ser Ser Ser Asn Tyr Asp His Leu Ser Ala Trp Leu 
820 825 830 835 

ttt aaa gat gtt gca ace ccy tea aca act tgg cca gat ggt agt aat 3195 
Phe Lys Asp Val Ala Thr Xaa Ser Thr Thr Trp Pro Asp Gly Ser Asn 
840 845 850 

ttt gtg aat caa gga eta tat gga agg tat att gat gta tea eta aaa 3243 
Phe Val Asn Gin Gly Leu Tyr Gly Arg Tyr He Asp Val Ser Leu Lys 
855 860 * 865 

act aac gee aaa gag att ggt ttt eta. ate tta gat gaa agt aag aca 3291 
Thr Asn Ala Lys Glu He Gly Phe Leu He Leu Asp Glu Ser Lys Thr 
870 875 " 880 

gga gat gca gtg aaa gtt caa ccc aac gac tat gtt ttt aga gat tta 3339 
Gly Asp Ala Val Lys Val Gin Pro Asn Asp Tyr Val Phe Arg Asp Leu 
885 890 895 

get aac cat aac caa att ttt gta aaa gat aag gat cca aag gtt tat 33 87 
Ala Asn His Asn Gin He Phe Val Lys Asp Lys Asp Pro Lys Val Tyr 
900 905 910 915 

aat aat cct tat tac att gat caa gtg cag eta aag gat gee caa caa 3435 
Asn Asn Pro Tyr Tyr He Asp Gin Val Gin Leu Lys Asp Ala Gin Gin 
920 925 930 

att gat tta aca agt att caa gca agt ttt aca act eta gat ggg gta 3483 
He Asp Leu Thr Ser He Gin Ala Ser Phe Thr Thr Leu Asp Gly Val 
935 940 945 

gat aaa act gaa att tta aaa gaa ttg aaa gtg act gat aaa aat caa 3531 
Asp Lys Thr Glu He Leu Lys Glu Leu Lys Val Thr Asp Lys Asn Gin 
950 955 960 

aat get ata caa att tct gat ate act etc gat act agt aaa tct ctt 3579 
Asn Ala He Gin He Ser Asp He Thr Leu Asp Thr Ser Lys Ser Leu 
965 970 * 975 



WO 99/42588 



PCT/CA99/00114 



42/63 



tta ata ate aaa ggc gac ttt aat cct aaa caa ggt cat ttc aac ata 
Leu He He Lys Gly Asp Phe Asn Pro Lys Gin Gly His Phe Asn He 
980 985 990 995 



3627 



tct tat aat ggt aac aat gtc atg aca agg caa tct tgg gaa ttt aaa 
Ser Tyr Asn Gly Asn Asn Val Met Thr Arg Gin Ser Trp Glu Phe Lys 
1000 1005 1010 



3675 



gac caa ctt tat get tat agt gga aat tta ggt gca gtt etc aat caa 
Asp Gin Leu Tyr Ala Tyr Ser Gly Asn Leu Gly Ala Val Leu Asn Gin 
1015 1020 1025 
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Asp Gly Ser Lys Val Glu Ala Ser Leu Trp Ser Pro Ser Ala Asp Ser 
1030 1035 1040 



3771 
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Val Thr Met He He Tyr Asp Lys Asp Asn Gin Asn Arg Val Val Ala 
1045 1050 1055 
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act ace ccc ctt gtg aaa aat aat aaa ggt gtt tgg cag acg ata ctt 
Thr Thr Pro Leu Val Lys Asn Asn Lys Gly Val Trp Gin Thr He Leu 
1060 1065 1070 1075 
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gat act aaa tta ggt att aaa aac tat act ggt tac tat tat ctt tac 
Asp Thr Lys Leu Gly He Lys Asn Tyr Thr Gly Tyr Tyr Tyr Leu Tyr 
1080 1085 1090 



3915 



gaa ata aaa aga ggt aag gat aag gtt aag att tta gat cct tat gca 
Glu He Lys Arg Gly Lys Asp Lys Val Lys He Leu Asp Pro Tyr Ala 
1095 HOO 1105 
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aag tea tta gca gag tgg gat agt aat act gtt aat gat gat att aaa 
Lys Ser Leu Ala Glu Trp Asp Ser Asn Thr Val Asn Asp Asp He Lys 
1110 1115 H20 
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acg get aaa gca get ttt gta aat cca agt caa ctt gga cct caa aat 
Thr Ala Lys Ala Ala Phe Val Asn Pro Ser Gin Leu Gly Pro Gin Asn 
1125 1130 H35 
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tta agt ttt get aaa att get aat ttt aaa gga aga caa gat get gtt 
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1140 1145 1150 1155 
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He Tyr Glu Ala His Val Arg Asp Phe Thr Ser Asp Arg Ser Leu Asp 
1160 1165 ~ 1170 
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Gly Lys Leu Lys Asn Gin Phe Gly Thr Phe Ala Ala Phe Ser Glu Lys 
1175 H80 H85 
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eta gat tat tta cag aaa tta gga gtt aca cac att cag ctt tta ccg 
Leu Asp Tyr Leu Gin Lys Leu Gly Val Thr His He Gin Leu Leu Pro 
1190 1195 1200 



4251 



WO 99/42588 



PCT/CA99/00114 



43/63 



gta ttg age tat ttt tat gtt aat gaa atg gat aag tea cgc tea aca 4299 
Val Leu Ser Tyr Phe Tyr Val Asn Glu Met Asp Lys Ser Arg Ser Thr 
1205 1210 1215 

get tac act tec tea gac aat aat tac aat tgg ggc tat gac cca cag 4347 
Ala Tyr Thr Ser Ser Asp Asn Asn Tyr Asn Trp Gly Tyr Asp Pro Gin 
1220 1225 1230 1235 

age tat ttt get ctt tct ggg atg tat tea gag aaa cca aaa gat cca 4395 
Ser Tyr Phe Ala Leu Ser Gly Met Tyr Ser Glu Lys Pro Lys Asp Pro 
1240 1245 1250 

tea gca cgt ate gee gaa tta aaa caa tta ata cat gat att cat aaa 4443 
Ser Ala Arg lie Ala Glu Leu Lys Gin Leu He His Asp He His Lys 
1255 1260 1265 

cgt ggc arg ggg gtt ata ctt gat gtc gtc tat aat cac act gca aaa 4491 
Arg Gly Met Gly Val He Leu Asp Val Val Tyr Asn His Thr Ala Lys 
1270 1275 1280 

act tat etc ttt gag gat ata gaa cct aat tat tat cac ttt atg aat 4539 
Thr Tyr Leu Phe Glu Asp He Glu Pro Asn Tyr Tyr His. Phe Met Asn 
1285 1290 * 1295 

gaa gat ggt tea cca aga gaa agt ttt gga ggg gga cgt tta gga ace 4587 
Glu Asp Gly Ser Pro Arg Glu Ser Phe Gly Gly Gly Arg Leu Gly Thr 
1300 1305 1310 *" 1315 

act cat gca atg agt cgt cgt gtt ttg gtt gat tec att aaa tat ctt 4635 
Thr His Ala Met Ser Arg Arg Val Leu Val Asp Ser He Lys Tyr Leu 
1320 1325 1330 

aca agt gaa ttt aaa gtt gat ggt ttc cgt ttt gat atg atg gga gat 4683 
Thr Ser Glu Phe Lys Val Asp Gly Phe Arg Phe Asp Met Met Gly Asp 
1335 1340 1345 

cat gat geg get gcg att gaa tta get tat aaa gaa get aaa get att 4731 
His Asp Ala Ala Ala He Glu Leu Ala Tyr Lys Glu Ala Lys Ala He 
1350 1355 1360 

aat cct aat atg att atg att ggt gag ggc tgg aga aca ttc caa ggc 4779 
Asn Pro Asn Met He Met He Gly Glu Gly Trp Arg Thr Phe Gin Gly 
1365 1370 1375 

gat caa ggt cag ccg gtt aaa cca get gac caa gat tgg atg aag tea 4827 
Asp Gin Gly Gin Pro Val Lys Pro Ala Asp Gin Asp Trp Met Lys Ser 
1380 1385 1390 ~ " * 1395 

acc gat aca gtt ggc gtc ttt tea gat gat att cgt aat age ttg aaa 4875 
Thr Asp Thr Val Gly Val Phe Ser Asp Asp He Arg Asn Ser Leu Lys 
1400 1405 1410 

tct ggt ttt cca aat gaa ggt act cca get ttc ate aca ggt ggc cca 4923 
Ser Gly Phe Pro Asn Glu Gly Thr Pro Ala Phe He Thr Gly Gly Pro 
1415 1420 1425 
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caa tct tta caa ggt att ttt aaa aat ate aaa gca caa cct ggg aat 4971 
Gin Ser Leu Gin Gly lie Phe Lys Asn He Lys Ala Gin Pro Gly Asn 
1430 1435 1440 

ttt gaa gca gat teg cca gga gat gtg gtg cag tat att get gca cat 5019 
Phe Glu Ala Asp Ser Pro Gly Asp Val Val Gin Tyr He Ala Ala His 
1445 1450 1455 

gat aac ctt ace ttg cat gat gtg att gca aaa tea att 5058 
Asp Asn Leu Thr Leu His Asp Val He Ala Lys Ser He 
1460 1465 1470 
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860 


Phe 


Pro 


Asn 


Glu 






875 




Leu 


Gin 


Gly 


He 




890 







Val 


Met 


Thr 


Arg 


Ser 


Gly 


Asn 


Leu 








480 


Ala 


Ser 


Leu 


Trp 






495 




Asp 


Lys 


Asp 


Asn 




510 






Asn 


Asn 


Lys 


Gly 


525 








Lys 


Asn 


Tyr 


Thr 


Asp 


Lys 


Val 


Lys 








560 


Asp 


Ser 


Asn 


Thr 






575 




Val 


Asn 


Pro 


Ser 




590 






Ala 


Asn 


Phe 


Lys 


605 








Arg 


Asp 


Phe 


Thr 


Phe 


Gly 


Thr 


Phe 








640 


Leu 


Gly 


Val 


Thr 






655 




Val 


Asn 


Glu 


Met 




670 






Asn 


Asn 


Tyr 


Asn 


685 








Gly 


Met 


Tyr 


Ser 


Leu 


Lys 


Gin 


Leu 








720 


Leu 


Asp 


Val 


Val 






735 




lie 


Glu 


Pro 


Asn 




750 






Glu 


Ser 


Phe 


Gly 


765 








Arg 


Val 


Leu 


Val 


Asp 


Gly 


Pne 


Arg 








800 


Glu 


Leu 


Ala 


Tyr 






815 




He 


Gly 


Glu 


Gly 




830 






Lys 


Pro 


Ala 


Asp 


845 








Phe 


Ser 


Asp 


Asp 


Gly 


Thr 


Pro 


Ala 








880 


Phe 


Lys 


Asn 


He 






895 
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Lys Ala Gin Pro Gly Asn Phe Glu Ala Asp Ser Pro Gly Asp Val Val 

900 905 910 

Gin Tyr He Ala Ala His Asp Asn Leu Thr Leu His Asp Val He Ala 

915 920 925 

Lys Ser He 
930 

<210> 27 

<211> 5607 
<212> DNA 

<213> streptococcus 

<220> 

<221> CDS 

<222> (2) . . . (301) 

<400> 27 

a att caa agt ttg aca gaa ggt caa ctt cgt tct gat ate cct gag ttc 4 9 

He Gin Ser Leu Thr Glu Gly Gin Leu Arg Ser Asp He Pro Glu Phe 
15 10 15 

cgt get ggt gat act gta cgt gtt cac get aaa gtt gtt gaa ggt act 97 
Arg Ala Gly Asp Thr Val Arg Val His Ala Lys Val Val Glu Gly Thr 
20 25 30 

cgc gaa cgt att cag ate ttt gaa ggt gtt gtt ate tea cgt aaa ggt 145 
Arg Glu Arg He Gin He Phe Glu Gly Val Val He Ser Arg Lys Gly 
35 40 45 

caa gga ate tea gaa atg tac aca gta cgt aaa att tct ggt ggt ate 193 
Gin Gly He Ser Glu Met Tyr Thr Val Arg Lys He Ser Gly Gly He 
50 55 60 

ggt gta gag cgt aca ttc cca att cac act cct cgt gtt gat aaa ate 241 
Gly Val Glu Arg Thr Phe Pro He His Thr Pro Arg Val Asp Lys He 
65 70 75 ~ ' 80 

gaa gtt gtt cgt tat ggt aaa gta cgt cgt get aaa ctt tac tac tta 289 
Glu Val Val Arg Tyr Gly Lys Val Arg Arg Ala Lys Leu Tyr Tyr Leu 
85 90 ~ 95 

cgc gca ttg caa ggtaaagctg caegtattaa agaaatccgt cgttaatttt 341 
Arg Ala Leu Gin 
100 



gatgatcaga ttttaaaaat gcttggttgt ttgaggatag taactatgtt ttaaaactgg 401 

acaaccaaga cgtaaaaaat ctgcctgtgg gcagtttttt tactaggtcc ccttagttca 461 

atggatataa caactccctc ctaaggagta attgctggtt cgattccggc aggggacata 521 

ttcattgeat gtaaatagcg gtttagagct attttgcccc aaatttctct gattaagttt 581 

ategttccta tctttttgtt cttgtaattg atgtgcgtaa acttctaaag tgatatttaa 641 

attctcgtga tctaaaactt gagagatgga aattagatag ettgeaaatg tatgectgag 701 

agagtgeact cgtacctcgc gaccagttat tttteggata gttttattga ctgeattatt 761 

tgaaagtttg tcgaataatc tgtcgttttt attttttgta aattcatgea aaaaaaataa 821 

tgtatcattg tcaattggta tatttctgat actacttttg ttttttgttg gcaggtatct 881 

ttggttgaaa tgataatccc aagttttatt aattgataaa tatttgttag tgtaatcaat 941 

atcattaact gttaaaccta aacattcagc gaagcgcatg ccagttttag cgatgaggta 1001 
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taacgctgca tacgattgat gttgtgattt ttctttacaa atttttatca agcgtaagta 1061 

ttcattggtt tcaagaaatt ttatctctat ttacgcccct tattttttgc tttaacctta 1121 

gtgaataaac aaaaattttt ttctatatat ccctcgtgaa cagccatgga tacgcaggct 1181 

tttacatgta tgttaaaacg ctttactgta tcttgcacat gcgtttgact ataatgattt 1241 

atgacttgtt gatatttagt ggaagtaata ttgcaaagta atatatttcc tattatatgt 1301 

ttatacgata ttcgatattc ccacccgttg tcgcgtttac ggaaatacgc cattgatata 1361 

ctccacatta gctaaagaac agggtgttca aggctacctt gatggaaaag gctctcttag 1421 

agatatttgt aaatggtatg atatctcaag tcgctctgtt ctccaaaagt ggataaaacg 1481 

gtatactagt ggtgaagact tgaaagccac tagtagagga tatagccgta tgaaacaagg 1541 

aaggcaagcc acatttgaag aacgtgtaga gattgttaac tacaccattg cccatgggaa 1601 

agactatcaa gcagctattg agaagtttgg tgtttcctac caacaaattt attcttgggt 1661 

gcgtaagctt gagaagaatg gctcacaagg tttggttgat agacgtgtga aagggttgga 1721 

gagtaggcct gatttaaccg agattgagca actttaactc aagattaaac aattggagga 1781 

acgtaatcgt ctcttagaaa tcgaggttag tttactaaaa aagttagaag acatcaaacg 1841 

aggaaacaga cggtaagact aggtaagcat ttagcggagt tccaagtaat caagaattat 1901 

tacgatgagg aatctaatgt gcctattcag gccttatgcc aactcttgaa ggggtctcgt 1961 

tcaggctatt acaagtggct caatcgtcaa aaaacagatt ttgagacaaa aaatacaaag 2021 

ctaatggcta aaatcaagga acttcgtaga ctctacaatg gtatcttagg ttatcgccgt 2081 

atgacaacat ttattaatcg tcaacttggg acaacttaaa acaagaaacg gattcgttga 2141 

ttgatgaaca ttctggggat tagttcagtc attcgtcgtg ttagccatgc ttgtacaaaa 2201 

gctggtgaca gattttacga agaaaatatt cttaatcgtg aatttacagc cacagctcat 2261 

aaccagaaat ggtgcacaga tgtcacctat cttcaatacg gtctgggagc taaagcttat 2321 

ctcagtgcga ttaaagacct gtataacggt tctattatcg cttatgagat tagtcacaac 23 81 

aatgaaatcc acttgttatg aagaccatta aaaaggggct agagctcaat ccaggagcca 2441 

cacctatcat ccatagcgat tgaggtagtc aatatacttc caaagaatac cgttatatca 2501 

tacaacaagc tggtctgacc ttatccatgt cccggattgg caaatgtatt gataatgcac 2561 

caactgaaag tttctttggg tttttcaaga ctgagtctta ccaccttaag aaatacaact 2621 

cttatgatga gttggtcaat gatgtggcac gttatatcga attctacaac acacaacgtt 2681 

atcaatcaaa attaaacaac ctgactcctc tagaattcag gaatcaggtt gcataactta 2741 

tcttttatta tttgactgtc tacttgacag ggagccgttc agattgctta acctttctaa 2801 

atttgctaaa atagctacaa gaaaacgagc catttaatgc ttatttctta tactgtcttg 2861 

cctcacgctc tcctcgacca aaaattgagc gtgaggcttt ttgtttcatt aaacgatgat 2921 

atttccatat tcatcagttt gttttccgag agccatcaaa gcttcgataa ggtcgataat 2981 

tccaggaata aaggtaatac taaaaataat atataaaaaa acctggccta tttttcctgc 3041 

gtaaaattta tgcgctccaa tgccgcccaa aagaacgtta ataaaacata aactactatg 3101 

ttagcataag actttatttt tacaactgaa tttcatataa atggattaga gtaagggata 3161 

aaagaaatta gcatagctct tttgaaaata aaaaaattaa tataatatgg aaaaaatttt 3221 

atttcataaa cgtttcataa aaggtatgta atctagtatt taggcaacac tattttgtca 3281 

ctggtgtcta gtaacttata gattgataat tttactagta aacgtaattc ttcgctttaa 3341 

gagttaaatg tctatttatt gtaagctaaa ttgggaggtg aacttatgta aaattagata 3401 

ggtactgtca agtacgggat gattattgaa acagccagta tgcatcataa aatctgtatt 3461 

gcttaataac tatttcctta accagacatc agttcattgt ttatcatcgc taccctaagt 3521 

ctagtttttt caatagagca ttaggtagtt tttgataata aaactatata aacatgagaa 3581 

ttagatttcg tattgcattc ttcataatga gttatttgag attttccttt gaataaatag 3641 

atacgaaatt cagtaacttc atatataaac ggctctatca ttgagatagt ttgtcaaatg 3701 

aagaaatttt taatggaaat agttttaaaa acattagttg taggcgatgt aaaaatatta 3761 

atccagtgga tgcaatagtt gcggagtaaa aatagagagg agtaattagg aagtgataaa 3821 

aaatgctata gcatatatta ccagaaaaaa aaatagaaca cttattatat ttgctatttt 3881 

aacaattgtt ctttcttgct tgtattcatg tttaacaata atgaaatcaa gtaatgaaat 3941 

agaaaaggct ttatatgaaa gttctaattc ttcaatatca attacaaaaa aagatggtaa 4001 

atattttaat attaatcaat ttaagaatat tgaaaaaata aaagaggttg aagaaaaaat 4061 

atttcaatat gatggattag caaaattgaa agatcttaaa gtagttagtg gtgagcaaag 4121 

tataaataga gaagatttat ctgacgaatt taaaaatgtt gtttcactag aagctacaag 4181 

taatactaaa agaaatcttt tatttagtag tggagtattt agttttaaag aaggaaaaaa 4241 

tatagaagaa aatgataaga attcaattct tgttcatgaa gaatttgcta aacaaaacaa 4301 

actaaaattg ggtgatgaaa ttgatcttga attactagat acggaaaaaa gtggaaaaat 4361 
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aaaaagtcat aaatttaaaa ttataggaat cttttctggt aaaaaacagg aaacatatac 4421 

aggattatca tctgatttta gcgaaaatat ggtttttgta gattattcaa ctagccaaga 4481 

aatattaaat aaatcagaga ataatagaat tgcaaataaa attttaatgt attctggtag 4541 

tttagaatct acagagcttg ccttaaacaa attgaaagac tttaaaattg ataagtcaaa 4601 

gtattctatt aagaaagata ataaagcatt cgaagagtct ttagagtcag tgagtggaat 4 661 

aaaacatata attaaaataa tgacttattc gattatgtta ggtggaatag ttgttctttc 4721 

attaatcttg attctatggt taagagaaag aatttatga'a ataggtatat ttttatctat 4781 

tggaacaact aagatacaaa ttataaggca atttatattt gagttaatat tcatatcaat 4 841 

accaagtata atatcctcct tatttttagg gaatctacta ttaaaagtaa ttgtagaagg 4901 

atttattaac tcagagaact caatgatttt cggtggaagt ttaataaata aaagcagttt 4961 

tatgttaaac ataacaacac ttgcagaaag ttatttaata ttaataagta ttattgtttt 5021 

atcagttgta atggcctctt cattaatatt atttaagaaa ccacaagaaa tattatcaaa 5081 

aataagttag gagcaaataa tggatatatt agaaataaag aatgtaaatt acagttacgc 5141 

aaattctaaa gaaaaagttt tgtcaggagt aaatcaaaaa tttgaacttg gaaagtttta 5201 

tgcgatagta gggaagtcag gaacaggaaa atccacactt ctttccttac ttgcaggact 5261 

tgataaagtt caaacaggaa aaatcttgtt taagaatgaa gatatagaaa agaaaggata 5321 

tagtaatcac agaaaaaata atatatcttt ggtatttcaa aattataatt taatagatta 5381 

tttatcgccg attgaaaata ttagactagt aaataaatca gtagatgaga gtatcttgtt 5441 

cgaattaggt ttagataaaa aacaaataaa aagaaatgtt atgaaattat ctggtggtca 5501 

gcaacaaagg gtagctattg ctagggcact ggtatcagat gccccaataa tactagctga 5561 

tgagcctacc ggtaacccag acagtgttac tgctggagaa ataatt 5607 

<210> 28 

<211> 111 
<212> PRT 

<213> streptococcus 
<400> 28 



lie 


Gin 


Ser 


Leu 


Thr 


Glu Gly Gin 


Leu 


Arg 


Ser 


Asp 


He 


Pro Glu Phe 


1 








5 






10 








15 


Arg 


Ala 


Gly 


Asp 


Thr 


Val Arg Val 


His 


Ala- 


Lys 


Val 


Val Glu Gly Thr 








20 






25 










30 


Arg 


Glu 


Arg 


He 


Gin 


He Phe Glu 


Gly 


Val 


Val 


He 


Ser Arg Lys Gly 






35 






40 










45 




Gin 


Gly 


He 


Ser 


Glu 


Met Tyr Thr 


Val 


Arg 


Lys 


He 


Ser Gly Gly He 




50 








55 








60 






Gly 


Val 


Glu 


Arg 


Thr 


Phe Pro lie 


His 


Thr 


Pro 


Arg 


Val Asp Lys He 


65 










70 






75 






80 


Glu 


Val 


Val 


Arg 


Tyr 


Gly Lys Val 


Arg 


Arg 


Ala 


Lys 


Leu 


Tyr Tyr Leu 










85 






90 








95 


Arg 


Ala 


Leu 


Gin 


Gly 


Lys Ala Ala 


Arg 


He 


Lys 


Glu 


He 


Arg Arg 








100 






105 










110 




<210> 


29 




















<211> 


173 




















<212> 


PRT 




















<213> 


streptococcus 
















<400> 


29 


















Met 


Arg 


Phe 


Ala Glu Cys Leu Gly 


Leu 


Thr 


Val 


Asn 


Asp 


He Asp Tyr 


1 








5 






10 








15 


Thr 


Asn 


Lys 


Tyr 


Leu 


Ser He Asn 


Lys 


Thr 


Trp 


Asp 


Tyr 


His Phe Asn 








20 






25 










30 


Gin 


Arg Tyr 


Leu 


Pro 


Thr Lys Asn 


Lys 


Ser 


Ser 


He 


Arg Asn He Pro 






35 






40 










45 




He 


Asp Asn 


Asp Thr Leu Phe Phe 


Leu 


His 


Glu 


Phe 


Thr Lys Asn Lys 
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50 


55 




Asn Asd fira Leu php 




Leu 


65 


70 




He Arg Lys He Thr 


Gly Arg 


Glu 


85 






Thr Phe Ala Ser Tyr 


Leu He 


Ser 


100 






Asn Leu Asn He Thr 


Leu Glu 


Val 


115 




120 


Lys Asp Arg Asn Asp 


Lys Leu 


Asn 


130 


135 




Ser Lys Pro Leu Phe 


Thr Cys 


Asn 


145 


150 




Thr Ser Asn Tyr Ser 


Leu Gly 


Gly 



165 



60 

Ser Asn Asn Ala Val Asn Lys Thr 

75 ' 80 

Val Arg Val His Ser Leu Arg His 

90 95 
He Ser Gin Val Leu Asp His Glu 
105 110 
Tyr Ala His Gin Leu Gin Glu Gin 
125 

Gin Arg Asn Leu Gly Gin Asn Ser 
140 

Glu Tyr Val Pro Cys Arg Asn Arg 
155 160 
Ser Cys Tyr He His 
170 



<210> 30 

<211> 389 

<212> PRT 

<213> streptococcus 



<400> 30 



Met 


Lys 


Ser 


Ser 


Asn 


Glu 


He 


Glu Lys Ala Leu Tyr 


Glu Ser Ser* A<5n 

w _i_ la u 1. 1 uci noil 


1 








5 






10 


15 


Ser 


Ser 


He 


Ser 


He 


Thr 


Lys 


Lys Asp Gly Lys Tyr 


Phe Asn He Asn 








20 








25 


30 


Gin 


Phe 


Lys 


Asn 


He 


Glu 


Lys 


He Lys Glu Val Glu 


Glu Lys He Phe 






35 










40 


45 


Gin Tyr 


Asp Gly 


Leu 


Ala 


Lys 


Leu Lys Asp Leu Lys 


Val Val Ser Gly 




50 










55 


60 


Glu 


Gin 


Ser 


He 


Asn 


Arg 


Glu 


Asp Leu Ser Asp Glu 


Phe Lys Asn Val 


65 










70 




75 


80 


Val 


Ser 


Leu 


Glu 


Ala 


Thr 


Ser 


Asn Thr Lys Arg Asn 


Leu Leu Phe Ser 










85 






90 


95 


Ser Gly 


Val 


Phe 


Ser 


Phe 


Lys 


Glu Gly Lys Asn He 


Glu Glu Asn Asp 








100 








105 


110 


Lys Asn 


Ser 


He 


Leu 


Val 


His 


Glu Glu Phe Ala Lys 


Gin Asn Lys Leu 






115 










120 


125 


Lys 


Leu 


Gly Asp 


Glu 


He 


Asp 


Leu Glu Leu Leu Asp 


Thr Glu Lys Ser 




130 










135 


140 


Gly Lys 


He 


Lys 


Ser 


His 


Lys 


Phe Lys He He Gly 


He Phe Ser Gly 


145 










150 




155 


160 


Lys 


Lys 


Gin 


Glu 


Thr 


Tyr 


Thr 


Gly Leu Ser Ser Asp 


Phe Ser Glu Asn 










165 






170 


175 


Met 


Val 


Phe 


Val 


Asp 


Tyr 


Ser 


Thr Ser Gin Glu He 


Leu Asn Lys Ser 








180 








185 


190 


Glu 


Asn 


Asn 


Arg 


He 


Ala 


Asn 


Lys He Leu Met Tyr 


Ser Gly Ser Leu 






195 










200 


205 


Glu 


Ser 


Thr 


Glu 


Leu 


Ala 


Leu 


Asn Lys Leu Lys Asp 


Phe Lys He Asp 




210 










215 


220 


Lys 


Ser 


Lys Tyr 


Ser 


He 


Lys 


Lys Asp Asn Lys Ala 


Phe Glu Glu Ser 


225 










230 




235 


240 


Leu 


Glu 


Ser 


Val 


Ser 


Gly 


He 


Lys His He He Lys 


lie Met Thr Tyr 










245 






250 


255 


Ser 


He 


Met 


Leu 


Gly 


Gly 


He 


Val Val Leu Ser Leu 


He Leu He Leu 








260 








265 


270 
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Trp Leu Arg 
275 

Thr Thr Lys 

290 
lie Ser He 
305 

Leu Lys Val 

Phe Gly Gly 

Thr Leu Ala 
355 

Val Val Met 

370 
Leu Ser Lys 
385 



Glu Arg lie Tyr 
He Gin 
Pro Ser 



He Val 
325 
Ser Leu 
340 

Glu Ser 
Ala Ser 
He Ser 



He He 
295 
He He 
310 

Glu Gly 



He Asn 

Tyr Leu 

Ser Leu 
375 



Glu He 
280 

Arg Gin 

Ser Ser 

Phe He 

Lys Ser 
345 
He Leu 
360 

He Leu 



Gly He 

Phe He 

Leu Phe 
315 
Asn Ser 
330 

Ser Phe 
He Ser 
Phe Lys 



Phe Leu 
285 
Phe Glu 
300 

Leu Gly 

Glu Asn 

Met Leu 

He He 
365 
Lys Pro 
380 



Ser He Gly 

Leu He Phe 

Asn Leu Leu 
320 

Ser Met He 

335 
Asn He Thr 
350 

Val Leu Ser 
Gin Glu He 



<210> 31 

<211> 169 

<212> PRT 

<213> streptococcus 



<400> 31 



Met 


Asp 


He 


Leu 


Glu 


He 


Lys 


Asn 


1 








5 








Lys 


Glu 


Lys 


Val 


Leu 


Ser 


Gly 


Val 








20 










Phe 


Tyr 


Ala 


He 


Val 


Gly 


Lys 


Ser 






35 










40 


Ser 


Leu 


Leu 


Ala 


Gly 


Leu 


Asp 


Lys 




50 










55 




Lys 


Asn 


Glu 


Asp 


He 


Glu 


Lys 


Lys 


65 










70 






Asn 


He 


Ser 


Leu 


Val 


Phe 


Gin 


Asn 










85 








Pro 


He 


Glu 


Asn 


lie 


Arg 


Leu 


Val 








100 










Leu 


Phe 


Glu 


Leu 


Gly 


Leu 


Asp 


Lys 






115 










120 


Lys 


Leu 


Ser 


Gly 


Gly 


Gin 


Gin 


Gin 




130 










135 




Val 


Ser 


Asp 


Ala 


Pro 


He 


lie 


Leu 


145 










150 






Asp 


Ser 


Val 


Thr 


Ala 


Gly 


Glu 


lie 



165 



Val 


Asn 
10 


Tyr 


Ser 


Tyr 


Ala 


Asn 
15 


Ser 


Asn 


Gin 


Lys 


Phe 


Glu 


Leu 


Gly 


Lys 


25 










30 






Gly 


Thr 


Gly 


Lys 


Ser 
45 


Thr 


Leu 


Leu 


Val 


Gin 


Thr 


Gly 
60 


Lys 


lie 


Leu 


Phe 


Gly 


Tyr 


Ser 
75 


Asn 


His 


Arg 


Lys 


Asn 
80 


Tyr 


Asn 
90 


Leu 


lie 


Asp 


Tyr 


Leu 
95 


Ser 


Asn 


Lys 


Ser 


Val 


Asp 


Glu 


Ser 


lie 


105 










110 






Lys 


Gin 


lie 


Lys 


Arg 
125 


Asn 


Val 


Met 


Arg 


Val 


Ala 


lie 
14 0 


Ala 


Arg 


Ala 


Leu 


Ala 


Asp 


Glu 
155 


Pro 


Thr 


Gly 


Asn 


Leu 
160 



lie 



<210> 32 

<211> 4171 

<212> DNA 

<213> Streptococcus 



<400> 32 

catatgacaa tatttttcaa agtctacatc acttactcgc ctgtcgtgga aaatctggca 60 

atacattaat cgaccaatta gttgctgatg gtttacttca tgcagataat cactaccatt 120 

ttttcaatgg gaagtctctg gccactttca atactaacca attgattcgc gaagttgtct 180 

atgttgaaat atccttagat actatgtcta gtggtgaaca tgatttagta aaagttaaca 240 



WO 99/42588 



PCT/CA99/00114 



53/63 



ttatcagacc cactaccgag catactatcc ccacgatgat gacagctagc ccctatcatc 300 

aaggtatcaa tgatcctgcc gcagaccaaa aaacatacca aatggagggt gcgctagcag 360 

ttaaacagcc taaacacata caagttgaca caaaaccatt taaagaagaa gtaaaacatc 420 

cttcaaaatt acccatcagc cctgcaactg aaagcttcac acacattgac agttatagtc 480 

tcaatgacta ttttctttct cgtggttttg ctaatatata cgtttcaggt gtgggtactg 540 

ctggctctac gggtttcatg accagtgggg attaccaaca aatacaaagc tttaaagcag 600 

tcattgattg gttaaatggt aaggttactg cattcacaag tcataaacga gataaacaag 660 

tcaaggctga ttggtcaaac ggccttgtag caaccacagg taaatcttat ctcggtacca 720 

tgtcaactgg tttagcaaca actggcgttg aggggctgaa agtcattatc gctgaagccg 780 

caatctccac atggtatgat tattatcgag aaaatgggct tgtgtgtagt ccaggcggct 840 

accccggtga agatttagac gttttaacag aattaacata ctcacgaaac ctcttagctg 900 

gtgattacat caaaaacaac gattgctatc aagcattgtt aaatgaacaa tcaaaagcaa 960 

ttgaccgtca aagtggggat tacaaccaat actggcatga ccgtaattac ctaactcacg 1020 

tcaataatgt caaaagtcga gtagtttaca ctcatggact acaggattgg aatgttaagc 1080 

caagacatgt ctacaaagtt ttcaatgcat tgcctcaaac catcaaaaaa cacctttttt 1140 

tacatcaagg tcaacatgtg tatatgcata attggcagtc gattgatttt cgtgaaagca 1200 

tgaatgcctt actaagccaa gaactacttg gcattgacaa tcatttccaa ttagaagagg 1260 

tcatttggca agataatact actgagcaaa cttggcaagt tttagatgct ttcggaggaa 1320 

accatcaaga gcaaattggt ttaggtgata gtaaaaaact tattgataac cattatgaca 1380 

aagaagcctt tgatacttat tgtaaagact tcaatgtgtt caaaaatgat cttttcaagg 1440 

gaaataataa aaccaatcaa atcactatta atcttcctct aaagaaaaat tatctcctga 1500 

atggacagtg caaactccat ctacgtgtta aaactagtga caaaaaggcc attttatcag 1560 

cccaaatctt agactatggt cctaaaaaac gattcaaaga tacaccaacc atcaaattct 1620 

taaacagcct tgataatggt aaaaattttg ccagagaagc tttacgtgaa ctcccgttta 1680 

ctaaagatca ttatcgtgtc atcagtaaag gtgtcttgaa ccttcaaaat cgtacagact 1740 

tacttacaat tgaggctatc gagccagaac aatggtttga tatcgagttt agcctccaac 1800 

caagtatata tcaattgagt aaaggtgata atctaaggat tatcctttat acaactgatt 1860 

ttgaacatac cattcgagat aatgctagtt actctataac agtagatttg agtcaatctt 1920 

atttaactat cccaactaat caaggaaatt aacttatgaa acttcttact aaagaacggt 1980 

ttgatgattc tcaacacttt tggtaccaga tcaatttatt acaagagagt aacttcggag 2040 

cagtttttga ccatgataat aaaaacattc cacaggttgt tgcaactatt gttgatgatt 2100 

tacaaggttc cggaagttcg aatcatttct ggtattttgg caatactact gatacttcca 2160 

tccttatgat tgctcattta aatcgaaaat tctatattca ggttaattta aaggactttg 2220 

actttgcact caatttaata gctataaata attggaagag tctcctccaa actcaacttg 22B0 

aagctctaaa cgatacccta gcaatatttc aataaataag gtagaatgga gtgacaaagc 2340 

aacgcgaggg agactgatta atgtcatctt attggaataa ctatcctgaa cttaaaaaaa 2400 

atattgatga aaccaatcaa ctaattcaag aaagaataca ggtcagaaat aaagatattg 2460 

aagcggcgct aagccaactc acagctgcgg gaggaaaaca gctcagacca gcattctttt 2520 

accttttttc tcaacttggt aataaggaga atcaagatac tcagcaacta aagaaaatcg 2580 

ctgcttcttt agaaatcctt cacgttgcta cattaatcca tgatgatgtc attgatgact 2640 

caccactaag acgtggaaat atgaccattc aaagcaagtt tggcaaagac atcgcagttt 2700 

atactgggga tttacttttc acagtctttt tcgatcttat tttagaatct atgactgata 2760 

caccatttat gaggattaat gcaaaatcta tgcgtaaaat tctcatggga gaattggacc 2820 

agatgcacct tcgttacaat caacaacaag gtatccatca ctatttacgt gcgatttcag 2880 

gtaagacagc cgaactcttt aaattagcta gcaaagaagg agcttacttt ggtggtgcag 2940 

agaaggaggt tgttcgtcta gcaggccata tcggctttaa cattggtatg acattccaaa 3000 

ttttggatga tatcctggat tatactgcag ataaaaaaac atttaataag cctgtcttag 3060 

aggatttaac acaaggcgtt tacagccttc ctctacttct tgccattgaa gaaaatcctg 3120 

atattttcaa acctatttta gataaaaaaa cagatatggc tactgaagac atggaaaaaa 3180 

ttgcttatct cgtcgtttcc catagaggtg ttgacaaagc tcgccatcta gctcgtaaat 3240 

ttactgagaa agctattagt gacataaata agctacccca gaactctgca aaaaaacagt 3300 

tgctacaatt aactaattac cttttaaaac gcaaaattta aataataaaa aaacattcca 3360 

caatgctaga aaagcagtta gggaatgttt ttttattatc atttatttat cgcacctatc 3420 

aatcatcata gatcaccatc atcagcggct ttcagctgac ggtaacgttg actactttga 3480 

gacaattctt gaggagaacc ttccaactct aattgcccat tttctataaa taagatacga 3540 

tcagcatgtt caataccttt taagtgatgt gtaatccaaa ctaaggtctt accttccaat 3600 
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tctttcataa atacccttag taaggcttgt tcagtaatag gatcaagtcc aacagttggc 3660 

tcatctaaga taacaattgg gacatctttt agtaagattc tagccaaagc aattctatgc 3720 

ctttcgccac ctgaaaacct aagtccagct tcatcaacca ttgtatagag accatctgat 3780 

aaatcagtga ccatctcttt caatccaact cgttcaagaa ctttccatac atcttcttca 3840 

ctagcatctt ggtttccaat gcgaatgtta tttagcaggg ttgtattaaa aaggtagggc 3900 

gcttgttgta tcactccaat atagttagaa atgcaatcac caactattga aacatcagca 3960 

ccgcctaggg taatcttccc ttgacttgct ttcaagtcgc cacgaagtag actagctaag 4020 

gtactcttgc cagaaccact ccgccctaaa atagcaattt tttctccttc tttaatatcc 4080 

aaatctaaat gatgcaaaac ccatttctct tgtggcttat actggaaact taaattcttg 4140 

acggaaaaat catatggctt attaggcaat t 4171 
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175 




Pro He 


Thr 


Glu 


Gin 


Ala 


Leu 


Leu Arg Val 


Phe 


Met 


Lys 


Glu 


Leu 


Glu 






180 








185 






190 






Gly Lys 


Thr 


Leu 


Val 


Trp 


He 


Thr His His 


Leu 


Lys 


Gly 


He 


Glu 


His 




195 










200 




205 








Ala Asp 


Arg 


He 


Leu 


Phe 


lie 


Glu Asn Gly 


Gin 


Leu 


Glu 


Leu 


Glu 


Gly 


210 










215 






220 








Ser Pro 


Gin 


Glu 


Leu 


Ser 


Gin 


Ser Ser Gin 


Arg 


Tyr 


Arg 


Gin 


Leu 


Lys 


225 








230 






235 










240 


Ala Ala 


Asp 


Asp 


Gly 


Asp 


Leu 






















245 





















<210> 37 
<211> 3480 
<212> DNA 

<213> Streptococcus 
<400> 37 

aattctattt ggaggttttt cttgaataaa tggttagtta aggcaagttc cttagttgtt 60 
ttaggtggta tggttttatc tgcgggttcc cgagttttag cggatactta tgtccgtcca 120 
attgataatg gtagaattac aacaggtttc aatggttatc ctggacattg tggggtggat 180 
tatgctgttc cgactggaac gattattagg gcagtggcag atggtactgt gaaatttgca 240 
ggagctggag ccaacttttc ttggatgaca gacttagcag gaaattgtgt catgattcaa 300 
catgcggatg gaatgcatag tggttacgct catatgtcac gtgtggtggc taggactggg 360 
gaaaaagtca aacaaggaga tatcatcggt tacgtaggag caactggtat ggcgacggga 420 



WO 99/42588 



PCT/CA99/00114 



58/63 



cctcaccttc attttgaatt tttaccagct aaccctaatt ttcaaaatgg tttccatgga 480 

cgtatcaatc caacgtcact aattgctaac gttgcgacct ttagtggaaa aacgcaagca 540 

tcagctccaa gcattaagcc attacaatca gctcctgtac agaatcaatc tagtaaatta 600 

aaagtgtatc gagtagatga attacaaaag gttaatggtg tttggttagt caaaaataac 660 

accctaacgc cgactgggtt tgattggaac gataatggta taccagcatc agaaattgat 720 

gaggttgatg ctaatggtaa tttgacagct gaccaggttc ttcaaaaagg tggttacttt 780 

atctttaatc ctaaaactct taagactgta gaaaaaccca tccaaggaac agctggttta 840 

acttgggcta agacacgctt tgctaatggt agttcagttt ggcttcgcgt tgacaacagt 900 

caagaactgc tttacaaata gtttgaggta ttgattcatt gttttaaatg acagttttgt 960 

tactaactaa gtacaatttc tttaaaccgt ctgaaaataa ttttatagtc cagtaaagtg 1020 

tgatattata gtctcggact aataaaaagg aaataggaat tgaagcaatg aaaatgaata 1080 

aaaaggtact attgacatcg acaatggcag cttcgctatt atcagtcgca agtgttcaag 1140 

cacaagaaac agatacgacg tggacagcac gtactgtttc agaggtaaag gctgatttgg 1200 

taaagcaaga caataaatca tcatatactg tgaaatatgg tgatacacta agcgttattt 1260 

cagaagcaat gtcaattgat atgaatgtct tagcaaaaat taataacatt gcagatatca 1320 

atcttattta tcctgagaca acactgacag taacttacga tcagaagagt catactgcca 1380 

cttcaatgaa aatagaaaca ccagcaacaa atgctgctgg tcaaacaaca gctactgtgg 1440 

atttgaaaac caatcaagtt tctgttgcag accaaaaagt ttctctcaat acaatttcgg 1500 

aaggtatgac accagaagca gcaacaacga ttgtttcgcc aatgaagaca tattcttctg 1560 

cgccagcttt gaaatcaaaa gaagtattag cacaagagca agctgttagt caagcagcag 1620 

ctaatgaaca ggtatcaaca gctcctgtga agtcgattac ttcagaagtt ccagcagcta 1680 

aagaggaagt taaaccaact cagacgtcag tcagtcagtc aacaacagta tcaccagctt 1740 

ctgttgccgc tgaaacacca gctccagtag ctaaagtagc accggtaaga actgtagcag 1800 

cccctagagt ggcaagtgtt aaagtagtca ctcctaaagt agaaactggt gcatcaccag 1860 

agcatgtatc agctccagca gttcctgtga ctacgacttc aacagctaca gacagtaagt 1920 

tacaagcgac tgaagttaag agcgttccgg tagcacaaaa agctccaaca gcaacaccgg 1980 

tagcacaacc agcttcaaca acaaatgcag tagctgcaca tcctgaaaat gcagggctcc 2040 

aacctcatgt tgcagcttat aaagaaaaag tagcgtcaac ttatggagtt aatgaattca 2100 

gtacataccg tgcaggtgat ccaggtgatc atggtaaagg tttagcagtc gactttattg 2160 

taggtaaaaa ccaagcactt ggtaatgaag ttgcacagta ctctacacaa aatatggcag 2220 

caaataacat ttcatatgtt atctggcaac aaaagtttta ctcaaataca aatagtattt 2280 

atggacctgc taatacttgg aatgcaatgc cagatcgtgg tggcgttact gccaaccatt 2340 

atgaccatgt tcacgtatca tttaacaaat aatataaaaa aggaagctat ttggcttctt 2400 

ttttatatgc cttgaataga ctttcaaggt tcttatctaa tttttattaa attgaggaga 2460 

ttaagctata agtctgaaac tactttcacg ttaaccgtga ctaaatcaaa acgttaaaac 2520 

taaaatctaa gtctgtaaag attattgaaa acgctttaaa aacagatata ataaggtttg 2580 

tagatatcta aaattaaaaa agataaggaa gtgagaatat gccacatcta agtaaagaag 2640 

cttttaaaaa gcaaataaaa aatggcatta ttgtgtcatg tcaagctttg cctggggagc 2700 

ctctttatac tgaaagtgga ggtgttatgc ctcttttagc tttggcagct caagaagcag 2760 

gagcggttgg tataagagcc aatagtgtcc gcgacattaa ggaaattcaa gaagttacta 2820 

atttacctat catcggcatt attaaacgtg aatatcctcc acaagaacca tttatcactg 2880 

ctacgatgac agaggtggat caattagcta gtttagatat tgcagtaata gccttagatt 2940 

gtacacttag agagcgtcat gatggtttga gtgtagctga gtttattcaa aagataaaag 3000 

ggaaatatcc tgaacagttg ctaatggctg atataagtac ttttgaagaa ggtaaaaatg 3060 

cttttgaagc aggagttgat tttgtgggta caactctatc tggatacaca gattacagcc 3120 

gccaagaaga aggaccggat atagaactcc ttaataagct ttgtcaagcc ggtatagatg 3180 

tgattgcgga aggtaaaatt catactccta agcaagctaa tgaaattaat catataggtg 3240 

ttgcaggaat tgtagttggt ggtgctatca ctagaccaaa agaaatagcg gagcgtttca 3300 

tctcaggact tagttaaaag tgttactcaa aaatcaaaat caaaataaaa aaggggaata 3360 

gttatgagta tcaaaaaaag tgtgattggt ttttgcctcg gagctgcagc attatcaatg 3420 

tttgcttgtg tagacagtag tcaatctgtt atggctgccg agaaggataa agtcgaaatt 3480 

<210> 38 
<211> 306 
<212> PRT 

<213> Streptococcus 
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<400> 


38 






Asn 


Ser 


He 


Trp 


Arg Phe Phe Leu Asn Lys Trp 


Leu Val Lys Ala Ser 


1 








5 10 


15 


Ser 


Leu 


Val 


Val 


Leu Gly Gly Met Val Leu Ser 


Ala Gly Ser Arg Val 








20 


25 


30 


Leu 


Ala 


Asp 


Thr 


Tyr Val Arg Pro He Asp Asn 


Gly Arg He Thr Thr 






35 




40 


45 


Gly 


Phe 


Asn 


Gly 


Tyr Pro Gly His Cys Gly Val 


Asp Tyr Ala Val Pro 




50 






55 


60 


Thr 


Gly 


Thr 


He 


He Arg Ala Val Ala Asp Gly 


Thr Val Lys Phe Ala 


65 








70 * 75 


80 


Gly 


Ala 


Gly 


Ala 


Asn Phe Ser Trp Met Thr Asp 


Leu Ala Gly Asn Cys 










85 90 


95 


Val 


Met 


He 


Gin 


His Ala Asp Gly Met His Ser 


Gly Tyr Ala His Met 








100 


105 


110 


Ser 


Arg 


Val 


Val 


Ala Arg Thr Gly Glu Lys Val 


Lys Gin Gly Asp He 






115 




120 


125 


He 


Gly 


Tyr 


Val 


Gly Ala Thr Gly Met Ala Thr 


Gly Pro His Leu His 




130 






135 


140 


Phe 


Glu 


Phe 


Leu 


Pro Ala Asn Pro Asn Phe Gin 


Asn Gly Phe His Gly 


145 








150 155 


160 


Arg 


He 


Asn 


Pro 


Thr Ser Leu He Ala Asn Val 


Ala Thr Phe Ser Gly 










165 170 


175 


Lys 


Thr 


Gin 


Ala 


Ser Ala Pro Ser He Lys Pro 


Leu Gin Ser Ala Pro 








180 


185 


190 


Val 


Gin 


Asn 


Gin 


Ser Ser Lys Leu Lys Val Tyr 


Arg Val Asp Glu Leu 






195 




200 


205 


Gin 


Lys 


Val 


Asn 


Gly Val Trp Leu Val Lys Asn 


Asn Thr Leu Thr Pro 




210 






215 


220 


Thr 


Gly 


Phe 


Asp 


Trp Asn Asp Asn Gly He Pro 


Ala Ser Glu He Asp 


225 








230 235 


240 


Glu 


Val 


Asp 


Ala 


Asn Gly Asn Leu Thr Ala Asp 


Gin Val Leu Gin Lys 










245 250 


255 


Gly 


Gly 


Tyr 


Phe 


He Phe Asn Pro Lys Thr Leu 


Lys Thr Val Glu Lys 








260 


265 


270 


Pro 


He 


Gin 


Gly 


Thr Ala Gly Leu Thr Trp Ala 


Lys Thr Arg Phe Ala 






275 




280 


285 


Asn 


Gly 


Ser 


Ser 


Val Trp Leu Arg Val Asp Asn 


Ser Gin Glu Leu Leu 




290 






295 


300 


Tyr 


Lys 










305 













<210> 39 
<211> 434 
<212> PRT 

<213> Streptococcus 
<400> 39 

Met Lys Met Asn Lys Lys Val Leu Leu Thr Ser Thr Met Ala Ala Ser 

15 10 15 

Leu Leu Ser Val Ala Ser Val Gin Ala Gin Glu Thr Asp Thr Thr Trp 

20 25 30 

Thr Ala Arg Thr Val Ser Glu Val Lys Ala Asp Leu Val Lys Gin Asp 

35 40 45 

Asn Lys Ser Ser Tyr Thr Val Lys Tyr Gly Asp Thr Leu Ser Val He 
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50 








55 










60 










Ser 


Glu Ala 


Met 


Ser 


lie Asp 


Met 


Asn 


Val 


Leu 


Ala 


Lys 


He 


Asn 


Asn 


D O 










70 








75 








80 


lie 


Ala Asp 


He 


Asn 


Leu He 


Tyr Pro 


Glu 


Thr 


Thr 


Leu 


Thr 


Val 


Thr 










85 








90 










95 




Tyr 


Asp Gin Lys 


Ser 


His Thr 


Ala 


Thr 


Ser 


Met 


Lys 


He 


Glu 


Thr 


Pro 








100 








105 










110 






AT * 

rtla 


Thr 


Asn 


Ala 


Ala Gly Gin 


inr 


Thr 


Ala 


Thr 


Val 


Asp Leu 


Lys 


Thr 






115 








ion 










125 






Asn 


Gin 


Val 


Ser 


Val Ala Asp 


bin 


T i wn 

Lys 


val 


Ser 


Leu 


Asn 


Thr 


He 


Ser 




130 








135 










140 










uiU 


Gly Met Thr 


Pro 


Glu Ala 


Al = 

Ala 


Thr 


Thr 


He 


Val 


Ser 


Pro 


Met 


Lys 


145 










150 








155 










160 




Tyr Ser Ser 


Ala 


Pro Ala 


Leu 


Lys 


Ser 


Lys 


Glu 


Val 


Leu 


Ala 


Gin 










165 








170 










175 




vj J. U 


Gin 


Ala 


Val 
180 


Ser 


Gin Ala 


Ala 


Ala 
185 


Asn 


Glu 


Gin 


Val 


Ser 
190 


Thr 


Ala 


Pro 


Val 


Lys 
195 


Ser 


He 


Thr Ser 


Glu 

o n f\ 


Val 


Pro 


Ala 


Ala 


Lys 
205 


Glu 


Glu 


Val 


Lys 


Pro 


Thr 


Gin 


Thr 


Ser Val 


Ser 


Gin 


Ser 


Thr Thr Val 


Ser 


Pro 


Ala 




210 








215 










220 












Val 


Ala 


Ala 


Glu 


Thr Pro 


Ala 


Pro 


Val 


Ala 


Lys 


Val 


Ala 


Pro 


Val 


225 










230 








235 










240 


Arg 


Thr 


Val 


Ala 


Ala 


Pro Arg 


vai 


Ala 


Ser 


Val 


Lys Val Val 


Thr 


Pro 










245 








ZOKJ 










255 




T,vc 


Val 


Glu 


Thr 


Gly Ala Ser 


Pro 


Glu 


His 


Val 


Ser Ala 


Pro 


Ala 


Val 








260 








Zbb 










270 






Pro 


Val 


Thr 


Thr 


Thr 


Ser Thr 


Ala 


Thr 


Asp 


Ser Lys 


Leu 


Gin 


Ala 


Thr 






275 


















285 








m ii 

ui u 


Val 
290 


Lys 


Ser 


Val 


Pro Val 
295 


Til - 

Ala 


Gin 


Lys 


Ala 


Pro 
300 


Thr 


Ala 


Thr 


Pro 


Val 


Ala 


Gin 


Pro 


Ala 


Ser Thr 


inr 


Asn 


Ala 


Val 


Ala 


Ala 


His 


Pro 


Glu 


305 










310 








315 










320 


Asn 


Ala Gly Leu 


Gin 


Pro His 


Val 


Ala 


Ala 


Tyr Lys 


Glu 


Lys 


Val 


Ala 










325 








J 3 U 










335 




Ser 


Thr Tyr Gly 


Val 


Asn Glu 


Phe 


Ser 


inr 


Tyr Arg Ala Gly 


Asp 


Pro 








340 








345 










350 






Gly 


Asp His Gly 


Lys Gly Leu 


Ala 


Val 


Asp 


Phe 


He Val Gly 


Lys Asn 






355 








360 










365 








Gin 


Ala Leu Gly 


Asn 


Glu Val 


Ala 


Gin 


Tyr 


Ser 


Thr 


Gin 


Asn 


Met 


Ala 




370 








375 










380 










Ala 


Asn 


Asn 


He 


Ser 


Tyr Val 


He 


Trp 


Gin 


Gin 


Lys 


Phe 


Tyr 


Ser 


Asn 


385 










3 90 








395 








400 


Thr 


Asn 


Ser 


He 


Tyr Gly Pro 


Ala 


Asn 


Thr 


Trp Asn Ala Met 


Pro Asp 










405 








410 










415 




Arg 


Gly Gly Val 


Thr 


Ala Asn 


His 


Tyr 


Asp 


His 


Val 


His 


Val 


Ser 


Phe 








420 








425 










430 







Asn Lys 



<210> 40 

<211> 232 

<212> PRT 

<213> Streptococcus 



<400> 40 
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Met Pro His 
1 

He He Val 

Ser Gly Gly 
35 

Ala Val Gly 
50 

Glu Val Thr 
65 

Pro Gin Glu 

Ala Ser Leu 

Arg His Asp 
115 

Lys Tyr Pro 
130 

Gly Lys Asn 
145 

Ser Gly Tyr 

Leu Leu Asn 

Lys He His 
195 

Ala Gly He 

210 
Glu Arg Phe 
225 



Leu Ser Lys 
5 

Ser Cys Gin 
20 

Val Met Pro 

He Arg Ala 

Asn Leu Pro 
70 

Pro Phe He 
85 

Asp He Ala 
100 

Gly Leu Ser 

Glu Gin Leu 

Ala Phe Glu 
150 

Thr Asp Tyr 
165 

Lys Leu Cys 
180 

Thr Pro Lys 

Val Val Gly 

lie Ser Gly 
230 



Glu Ala 

Ala Leu 

Leu Leu 

40 
Asn Ser 
55 

He He 

Thr Ala 

Val lie 

Val Ala 
120 
Leu Met 
135 

Ala Gly 

Xaa Arg 

Gin Ala 

Gin Ala 
200 
Gly Ala 
215 

Leu Ser 



Phe Lys Lys 
10 

Pro Gly Glu 
25 

Ala Leu Ala 

Val Arg Asp 

Gly lie lie 
75 

Thr Met Thr 
90 

Ala Leu Asp 
105 

Glu Phe lie 

Ala Asp lie 

Val Asp Phe 
155 

Gin Glu Glu 
170 

Gly lie Asp 
185 

Asn Glu lie 
He Thr Arg 



Gin lie 

Pro Leu 

Ala Gin 

45 
lie Lys 
60 

Lys Arg 

Glu Val 

Cys Thr 

Gin Lys 
125 
Ser Thr 
140 

Val Gly 

Gly Pro 

Val He 

Asn His 
205 
Pro Lys 
220 



Lys Asn Gly 
15 

Tyr Thr Glu 
30 

Glu Ala Gly 

Glu He Gin 

Glu Tyr Pro 
80 

Asp Gin Leu 
95 

Leu Arg Glu 
110 

lie Lys Gly 

Phe Glu Glu 

Thr Thr Leu 
160 

Asp He Glu 
175 

Ala Glu Gly 
190 

lie Gly Val 
Glu He Ala 



<210> 41 

<211> 39 

<212> PRT 

<213> Streptococcus 



<400> 41 
Met Ser He Lys Lys Ser Val He 

1 5 
Leu Ser Met Phe Ala Cys Val Asp 
20 

Glu Lys Asp Lys Val Glu lie 
35 



Gly Phe Cys Leu Gly Ala Ala Ala 
10 15 
Ser Gin Ser Val Met Ala Ala 
30 



<210> 42 
<211> 1305 
<212> DNA 

<213> Streptococcus 



<400> 42 

atgaaaatga ataaaaaggt actattgaca tcgacaatgg cagcttcgct attatcagtc 60 

gcaagtgttc aagcacaaga aacagatacg acgtggacag cacgtactgt ttcagaggta 120 

aaggctgatt tggtaaagca agacaataaa tcatcatata ctgtgaaata tggtgataca 180 

ctaagcgtta tttcagaagc aatgtcaatt gatatgaatg tcttagcaaa aattaataac 240 

attgcagata tcaatcttat ttatcctgag acaacactga cagtaactta cgatcagaag 300 

agtcatactg ccacttcaat gaaaatagaa acaccagcaa caaatgctgc tggtcaaaca 360 
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acagctactg tggatttgaa aaccaatcaa gtttctgttg cagaccaaaa agtttctctc 420 

aatacaattt cggaaggtat gacaccagaa gcagcaacaa cgattgtttc gccaatgaag 480 

acatattctt ctgcgccagc tttgaaatca aaagaagtat tagcacaaga gcaagctgtt 54 0 

agtcaagcag cagctaatga acaggtatca acagctcctg tgaagtcgat tacttcagaa 600 

gttccagcag ctaaagagga agttaaacca actcagacgt cagtcagtca gtcaacaaca 660 

gtatcaccag cttctgttgc cgctgaaaca ccagctccag tagctaaagt agcaccggta 720 

agaactgtag cagcccctag agtggcaagt gttaaagtag tcactcctaa agtagaaact 780 

ggtgcatcac cagagcatgt atcagctcca gcagttcctg tgactacgac ttcaacagct 840 

acagacagta agttacaagc gactgaagtt aagagcgttc cggtagcaca aaaagctcca 900 

acagcaacac cggtagcaca accagcttca acaacaaatg cagtagctgc acatcctgaa 960 

aatgcagggc tccaacctca tgttgcagct tataaagaaa aagtagcgtc aacttatgga 1020 

gttaatgaat tcagtacata ccgtgcaggt gatccaggtg atcatggtaa aggtttagca 1080 

gtcgacttta ttgtaggtaa aaaccaagca cttggtaatg aagttgcaca gtactctaca 1140 

caaaatatgg cagcaaataa catttcatat gttatctggc aacaaaagtt ttactcaaat 1200 

acaaatagta tttatggacc tgctaatact tggaatgcaa tgccagatcg tggtggcgtt 1260 

actgccaacc attatgacca tgttcacgta tcatttaaca aataa 1305 

<210> 43 
<211> 1230 
<212> DNA 

<213> Streptococcus 

<400> 43 
caagaaacag atacgacgtg 
aagcaagaca ataaatcatc 
gaagcaatgt caattgatat 
cttatttatc ctgagacaac 
tcaatgaaaa tagaaacacc 
ttgaaaacca atcaagtttc 
ggtatgacac cagaagcagc 
ccagctttga aatcaaaaga 
aatgaacagg tatcaacagc 
gaggaagtta aaccaactca 
gttgccgctg aaacaccagc 
cctagagtgg caagtgttaa 
catgtatcag ctccagcagt 
caagcgactg aagttaagag 
gcacaaccag cttcaacaac 
cctcatgttg cagcttataa 
acataccgtg caggtgatcc 
ggtaaaaacc aagcacttgg 
aataacattt catatgttat 
ggacctgcta atacttggaa 
gaccatgttc acgtatcatt 

<210> 44 
<211> 409 
<212> PRT 

<213> Streptococcus 
<400> 44 

Gin Glu Thr Asp Thr Thr Trp Thr Ala Arg Thr Val Ser Glu Val Lys 

15 10 15 

Ala Asp Leu Val Lys Gin Asp Asn Lys Ser Ser Tyr Thr Val Lys Tyr 

20 25 30 

Gly Asp Thr Leu Ser Val He Ser Glu Ala Met Ser He Asp Met Asn 



gacagcacgt actgtttcag 
atatactgtg aaatatggtg 
gaatgtctta gcaaaaatta 
actgacagta acttacgatc 
agcaacaaat gctgctggtc 
tgttgcagac caaaaagttt 
aacaacgatt gtttcgccaa 
agtattagca caagagcaag 
tcctgtgaag tcgattactt 
gacgtcagtc agtcagtcaa 
tccagtagct aaagtagcac 
agtagtcact cctaaagtag 
tcctgtgact acgacttcaa 
cgttccggta gcacaaaaag 
aaatgcagta gctgcacatc 
agaaaaagta gcgtcaactt 
aggtgatcat ggtaaaggtt 
taatgaagtt gcacagtact 
ctggcaacaa aagttttact 
tgcaatgcca gatcgtggtg 
taacaaataa 



aggtaaaggc tgatttggta 60 

atacactaag cgttatttca 120 

ataacattgc agatatcaat 180 

agaagagtca tactgccact 240 

aaacaacagc tactgtggat 300 

ctctcaatac aatttcggaa 360 

tgaagacata ttcttctgcg 420 

ctgttagtca agcagcagct 480 

cagaagttcc agcagctaaa 540 

caacagtatc accagcttct 600 

cggtaagaac tgtagcagcc 660 

aaactggtgc atcaccagag 720 

cagctacaga cagtaagtta 780 

ctccaacagc aacaccggta 840 

ctgaaaatgc agggctccaa 900 

atggagttaa tgaattcagt 960 

tagcagtcga ctttattgta 1020 

ctacacaaaa tatggcagca 1080 

caaatacaaa tagtatttat 1140 

gcgttactgc caaccattat 1200 

1230 
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35 40 45 

Val Leu Ala Lys He Asn Asn He Ala Asp He Asn Leu He Tyr Pro 

50 55 60 

Glu Thr Thr Leu Thr Val Thr Tyr Asp Gin Lys Ser His Thr Ala Thr 
65 70 75 80 

Ser Met Lys He Glu Thr Pro Ala Thr Asn Ala Ala Gly Gin Thr Thr 

85 90 95 

Ala Thr Val Asp Leu Lys Thr Asn Gin Val Ser Val Ala Asp Gin Lys 

100 105 no 

Val Ser Leu Asn Thr He Ser Glu Gly Met Thr Pro Glu Ala Ala Thr 

H5 120 125 

Thr He Val Ser Pro Met Lys Thr Tyr Ser Ser Ala Pro Ala Leu Lys 

130 135 140 

Ser Lys Glu Val Leu Ala Gin Glu Gin Ala Val Ser Gin Ala Ala Ala 
145 150 155 160 

Asn Glu Gin Val Ser Thr Ala Pro Val Lys Ser He Thr Ser Glu Val 

165 170 175 

Pro Ala Ala Lys Glu Glu Val Lys Pro Thr Gin Thr Ser Val Ser Gin 

180 185 190 

Ser Thr Thr Val Ser Pro Ala Ser Val Ala Ala Glu Thr Pro Ala Pro 

195 200 205 

Val Ala Lys Val Ala Pro Val Arg Thr Val Ala Ala Pro Arg Val Ala 

210 215 220 

Ser Val Lys Val Val Thr Pro Lys Val Glu Thr Gly Ala Ser Pro Glu 
225 230 235 240 

His Val Ser Ala Pro Ala Val Pro Val Thr Thr Thr Ser Thr Ala Thr 

245 250 255 

Asp Ser Lys Leu Gin Ala Thr Glu Val Lys Ser Val Pro Val Ala Gin 

260 265 270 

Lys Ala Pro Thr Ala Thr Pro Val Ala Gin Pro Ala Ser Thr Thr Asn 

275 280 285 

Ala Val Ala Ala His Pro Glu Asn Ala Gly Leu Gin Pro His Val Ala 

290 295 300 

Ala Tyr Lys Glu Lys Val Ala Ser Thr Tyr Gly Val Asn Glu Phe Ser 
305 310 315 320 

Thr Tyr Arg Ala Gly Asp Pro Gly Asp His Gly Lys Gly Leu Ala Val 

325 330 ' 335 

Asp Phe He Val Gly Lys Asn Gin Ala Leu Gly Asn Glu Val Ala Gin 

340 345 ' 350 

Tyr Ser Thr Gin Asn Met Ala Ala Asn Asn He Ser Tyr Val He Trp 

355 360 365 

Gin Gin Lys Phe Tyr Ser Asn Thr Asn Ser He Tyr Gly Pro Ala Asn 

370 375 380 

Thr Trp Asn Ala Met Pro Asp Arg Gly Gly Val Thr Ala Asn His Tyr 
385 390 395 400 

Asp His Val His Val Ser Phe Asn Lys 
405 



